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INTERNATIONAL PRELIMINARY 

EXAMINATION REPORT International application No. PCT/EP 03A)3287 



I. Basis of the report 

1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office in response to an invitation under Article 14 are referred to in this report as "originally filed" 
and are not annexed to this report since they do not contain amendments (Rules 70. 16 and 70. 1 7)): 



Description, Pages 

1 -85 as originally filed 
Claims, Numbers 

1 -47 received on 1 3.02.2004 with letter of 1 3.02.2004 
Drawings, Sheets 

1/7-7/7 as originally filed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes ov >':=. <■ ^ International search (under Rule 23.1 (b)). % 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under 
Rule 55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form. 
El furnished subsequently to this Authority in written form. 

furnished subsequently to this Authority in computer readable form. 

El The statement that the subsequently furnished written sequence listing does not go beyond the disclosure 
in the international application as filed has been furnished. 

The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 
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5. □ This report has been established as if (some of) the amendments had not been made, since they have 

been considered to go beyond the disclosure as filed (Rule 70.2(c)). 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional observations, if necessary: 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability- 
citations and explanations supporting such statement 



Statement 
Novelty (N) 

Inventive step (IS) 

Industrial applicability (I A) 



Yes: 
No: 



Claims 
Claims 



Yes: Claims 

No: Claims 

Yes: Claims 

No: Claims 



1-8 
9-47 

1-47 
1-47 



2. Citations and explanations 
see separate sheet 
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Re Item V 

Reasoned statement with regard to novelty, inventive step or industrial 
applicability; citations and explanations supporting such statement 

Reference is made to the following documents: 

D1 : STAGLJAR I ET AL: "A genetic system based on split-uniquitin for the 
analysis of interactions between membrane proteins in vivo" 
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF USA, 
NATIONAL ACADEMY OF SCIENCE. WASHINGTON, US, vol. 95, April 
1998 (1998-04), pages 5187-5192, XP002171779 ISSN: 0027-8424 

D2: DUNNWALD M ET AL: "Detection of transient in vivo interactions between 
substrate and transporter during protein translocation into the endoplasmic 
reticulum." MOLECULAR BIOLOGY OF THE CELL. UNITED STATES FEB 

1999, vol. 10, no. 2, February 1999 (1999-02), pages 329-344, 
XP008006622 ISSN: 1059-1524 

D3: WITTKE SANDRA ET AL: "Probing the molecular environment of membrane 
proteins in vivo." MOLECULAR BIOLOGY OF THE CELL, vol. 10, no. 8, 
August 1999 (1999-08), pages 2519-2530, XP008006621 ISSN: 1059-1524 

D4: STAGLJAR I AND TE HEESEN S: "Detecting interactions between 

membrane proteins in vivo using chimeras. In: Methods in Enzymology: 
Applications of chimeric genes and hybrid proteins, Part B" METHODS IN 
ENZYMOLOGY, ACADEMIC PRESS INC, SAN DIEGO, CA, US, vol. 327, 

2000, pages 1 90-1 98, XP0021 71 780 ISSN: 0076-6879 

D5: LASER HEIKE ET AL: "A new screen for protein interactions reveals that the 
Saccharomyces cerevisiae high mobility group proteins Nhp6A/B are 
involved in the regulation of the GAL1 promoter." PROCEEDINGS OF THE 
NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES, vol. 97, 
no. 25, 5 December 2000 (2000-12-05), pages 13732-13737, XP001 074698 
December 5, 2000 ISSN: 0027-8424 

D6: WO 02/27020 A (STAGLJAR IGOR ;UNIV ZUERICH (CH); BUERKI 
CHRISTINE (CH)) 4 April 2002 (2002-04-04) 

1 Novelty 

1.1 D1 discloses a method for detection of interaction between membrane proteins 
using a split-ubiquitin system that can be used in vivo (cf. D1, p. 5187, right-hand 
col., para. 2, fig. 2). The split-ubiquitin system is specifically designed to measure 
interaction of proteins in membranes employing a transcription factor (A-LexA- 
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VP16) that can be cleaved off the ubiquitin part and act as an activator for a 
reporter gene. D2-D5 disclose similar methods all using the split-ubiquitin system 
(D2, see abstract, fig. 1 ; D3, see page 2521 , right-hand col. to p. 2523, right- 
hand col., para. 1; D4, see abstract, figure 1; D5, see abstract, p. 13737, left-hand 
col., last para, to right-hand col., last para.). D1-D6 disclose all the features of 
claims 9-47 in combination thereby also anticipating the "kit" of 9-17. Accordingly, 
claims 9-47 are not novel. 

1 .2 None of these documents discloses the use of two bait vectors having a first 
chimeric gene under the control of a strong and a weak promoter respectively for 
use in a parallel procedure (see page 43, lines 11-13). Accordingly, claims 1-8 are 
novel over the prior art cited. 

1 .3 D6 (see p. 14, line 20 to p. 1 9) discloses a further method using such a system. 
However, D6 was filed before the priority of the present application but was 
published to late. This document could become relevant in the regional phase for 
novelty. 

2 Inventive step 

2.1 D1 is considered to represent the closest prior art. It discloses a method for 

detection of interaction between membrane proteins using a split-ubiquitin system 
that can be used in vivo (cf. D1, p. 5187, right-hand col., para. 2, fig. 2). The split- 
ubiquitin system is specifically designed to measure interaction of proteins in 
membranes employing a transcription factor (A-LexA- VP16) that can be cleaved 
off the ubiquitin part and act as an activator for a reporter gene. The difference 
between subject-matter of claim 1-8 and D1 is the use of two variants of a bait 
vector carrying a first chimeric gene which is under control of a strong and a weak 
promoter respectively. There seems to be no technical effect associated with this 
technical difference. The objective problem which claims 1-8 want to overcome 
can therefor be formulated as being "How to provide an alternative system for 
detecting interaction between a first membrane bound protein and a second 
protein?". The solution which claims 1-8 provide is considered to be obvious since 
the technical difference in light of the prior art seems to have no effect on the 
method. 

3 Subject-matter of all claims is industrially applicable. 
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4 Remarks: 

4.1 The intelligibility of the claims could be improved if the "two-part form" were used 
in accordance with Rule 6.3(b) PCT, which in the present case would be 
appropriate, with those features known in combination from the prior art 
(documents D1-D5) being placed in the preamble (Rule 6.3(b)(i) PCT and with the 
remaining features being included in the characterising part (Rule 6.3(b)(ii)) PCT. 

4.2 The term "and a promoter" as presently used in claim 9(b) is unclear (Article 6 
PCT) in that it is not mentioned which genes are under control of this promoter. 

4.3 The term "a first [or second] protein sequence involved in intracellular protein 
degradation" is not fully supported in the application (Article 6 PCT) as there is 
disclosure only for the "split-ubiquitin" system (Article 5 PCT). 

4.4 The method of "providing a compound" (claim 47) does not fulfil the requirements 
of Article 6 PCT as the claim does not define the subject-matter for which 
protection is sought. The person skilled in the art would not understand which 
compounds could be provided by this method as thove h: no structural definition of 
such compounds. 
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CLAIMS 

1. A method for detecting an interaction between a first membrane bound protein or 
part thereof and a second protein or part thereof which is either membrane bound 
or soluble, the method comprising: 

(a) providing a host cell containing at least one detectable gene (reporter 
gene) having a binding site for a transcriptional activator, such that the 
detectable gene expresses a detectable product, preferably a protein, 
when the detectable gene is transcriptionally activated; 

(b) (i) providing, as part of a first bait vector, a first chimeric gene under the 
control of a strong ?f<;moter, and . 

(ii) providing, as part of a second bait vector, a first chimeric gene under 
the control of a weak promoter, 

said first chimeric gene being capable of being expressed in said host cell 
and coding inter alia for a first membrane protein or part thereof which 
gene is attached to the DNA-sequence of a first module encoding inter alia 
a first protein sequence involved in intracellular protein degradation and a 
transcriptional activator, said first protein or part thereof to be tested 
whether it can interact with a second protein or part thereof; 

(c) providing, as part of a prey vector, a second chimeric gene under the 
control of a strong promoter capable of being expressed in said host cell, 
the second chimeric gene coding inter alia for a second protein or part 
thereof which is either membrane bound or soluble and which gene is 
attached to the DNA sequence of a second module encoding inter alia a 
second protein sequence involved in intracellular protein degradation; 

(d) introducing the first repectively the second bait vector and the prey vector 
into the host cell such that an interaction between the expressed first and 
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second protein and/or their parts can take place, which interaction leads to 
an interaction of the first protein sequence of the first module and the 
second protein sequence of the second module which interaction in turn 
leads to activation of an intracellular protease and proteolytic separation of 
the transcriptional activator, at least one of the bait vector and prey vector, 
preferably both, being suitable for being maintained episomally; 

(e) determining whether the detectable gene of the host cell has been 
activated by the transcriptional activator. 

The method according to claim 1, wherein the host cell is a yeast, a bacterial or a 
mammalian cell. 

The method according to claim 2, wherein the host cell is a Saccharomyces 
pombe cell or, more preferably, cells of the budding yeast Saccharomyces 
cerevisiae. 

The method according to any of claims 1 to 3, wherein the detectable gene is 
•vc^rted by a natural or artificial activator, preferably an activator comprising a 
sf?ert tagging module. 

The method according to any of claims 1 to 4, wherein the detectable gene is by 
the artificial transcriptional activator protein A-LexA-V16 (PLV). 

The method according to any of claims 1 to 5, wherein the first protein sequence 
comprises a C-terminal portion of ubiquitin (Cub) or a mutant thereof (CbM) and 
the second protein sequence comprises an N-terminal portion of ubiquitin (Nub) 
or a mutant thereof (NbM). 

The method according to any of claims 1 to 6, wherein the DNA-sequence coding 
for the first membrane protein is selected from the group consisting of any bacte- 
rial membrane protein, any viral membrane protein, any oncogene-encoded 
membrane protein, any growth factor receptor or any eukaryotic membrane pro- 
tein, or parts thereof. 

The method according to any one of the preceding claims wherein the second 
membrane protein or the soluble protein, or part thereof, is encoded by a library: 
plasmids. 
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9. A kit for detecting binding between a first membrane bound protein or part thereof 
a second protein or part thereof which is either membrane bound or soluble 
comprising: 



(a) a host cell containing at least one detectable gene (reporter gene) having 
a binding site for a transcriptional activator, such that the detectable gene 
expresses a detectable product, preferably a protein, when the detectable 
gene is transcriptionally activated; 

(b) a first vector (bait) comprising a first site capable of receiving a first nucleic 
acid coding for a first membrane protein or part thereof such that when the 
first nucleic acid is inserted it becomes attached to the DNA sequence of a 
first module encoding inter afia a first protein sequence involved in 
intracellular protein degradation, the module further comprising a nucleic 
acid for a transcriptional activator and a promoter; 

(c) a second vector (prey) comprising a second site capable of receiving a se- 
cond nucleic acid coding for a second membrane protein or a soluble pro- 
tein or part thereof such that when the second nucleic acid is inserted W 
becomes attached to the DNA sequence of a second module encoding 
inter alia a sequence protein sequence involved in intracellular protein 
degradation, wherein the module further comprises a promoter; and 
optionally 



(d) a plasmid library encoding second proteins or parts thereof. 



10. The kit according to claim 9, wherein the host cell is a yeast, bacterial or mamma- 
lian cell. 

11. The kit according to claim 10 f wherein the host cell is a yeast cell, preferably of 
Saccharomyces pombe and most preferably of the budding yeast 
Saccharomyces cerevisiae. 

12. The kit according to any of claims 9 to 11, wherein the detectable gene can be 
activated by a natural or artificial activator, preferably an activator comprising a 
short tagging module. 
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13. The kit according to any of claims 9 to 12, wherein the detectable gene can be 
activated by the artificial transcriptional activator protein A-LexA-V16 (PLV). 

14. The kit according to any of claims 9 to 13, wherein the first protein sequence 
contains the C-terminal part of ubiquitin (Cub) or a mutant thereof and the second 
protein sequence contains the N-Terminal part of ubiquitin (Nub) or a mutant the- 
reof. 

15. The kit according to any of claims 9 to 14, wherein the promoter is selected from 
the group consisting of ADH promoter, CYC1 promoter or TEF1 promoter. 

16. The kit according to any of claims 9 to 15, wherein the DNA-sequence coding for 
the first membrane protein is derived from any bacterial membrane protein, any 
viral membrane protein, any oncogene-encoded membrane protein, any growth 
factor receptor or any eukaryotic membrane protein, or parts thereof. 

17. The kit according to any of claims 9 to 16 wherein the DNA-sequence coding for 
the second protein is contained in a library of plasmids. 

18. A vector useful as a bait vector in a method according to ©r*y of-claims 1 to 8 or a 
kit according to any of claims 9 to Comprising the following elements: 

(1 ) a selection marker for propagation of the vector in E. coii; 

(2) an origin of replication which allows propagation of the vector in E. coli; 

(3) a further selection marker for propagation of the vector in yeast; 

(4) an origin of replication which allows propagation of the vector in yeast; and 

(5) an expression cassette comprising the following elements: 

(a) a promoter element conferring low level expression or inducible 
expression or high level expression; 

(b) a nucleic acid sequence encoding a leader; 

(c) a site capable of receiving a nucleic acid encoding a protein; 
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(d) a nucleic acid sequence encoding the C-terminal part of yeast ubiquitin 
(Cub) or mutated yeast ubiquitin (CbM); 

(e) a nucleic acid sequence encoding a tag that allows detection of the fusion 
protein; 

(f) a nucleic acid sequence encoding a DNA binding protein; 

(g) a nucleic acid sequence encoding a transcriptional activator; and (h) a 
terminator sequence. 

The vector according to claim 18, wherein the selection marker according to 
element (1) comprises a marker selected from the ampicilline resistance gene, 
the kanamycine resistance gene or the chloramphenicol resistance gene. 

The vector according to any of claims 18 or 19, wherein the origin of replication 
according to element (2) is selected from a pUC based or pBR322 origin of 
replication. 

The vector according to any tff claims 18 to 20, wherein the selection marker 
according to element (3) is selected from any marker that is selectable in S. 
cerevisiae, preferably the LEU2 gene. 

The vector according to any of claims 18 to 21, wherein the origin of replication 
according to element (4) is a low copy number origin, preferably selected from 
CEN and ARS origins. 

The vector according to any of claims 18 to 22, wherein the promoter according 
to element (5a) is selected from CYC1, GAL1, CUP1. ADH1 or TEFL 

The vector according to any of claims 18 to 23, wherein the nucleic acid sequence 
encoding a leader according to element (5b) is selected from a signal sequence 
derived from a yeast integral membrane protein or a signal sequence which 
confers fatty acid modification preferably to the following peptide (N- 
MGCTLSAEDKPGGP-C). 

The vector according to any of claims 18 to 24, wherein the site for receiving a 
nucleic acid sequence according to feature (5c) is a multiple cloning site or a se- 
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quence allowing the insertion of a further nucleic acid sequence by in vivo recom- 
bination. 

26. The vector according to any of claims 18 to 25, wherein the nucleic acid 
sequence to element (5d) encodes the C-terminal part of yeast ubiquitin from 
amino acids 35 to 76. 

27. The vector according to any of claims 18 to 26, wherein the tag for detecting the: 
expression of the polypeptide fusion is selected from epitopes of Gal4, LexA, B42 
or VP16 or 3x FLAG, 3x MYC or HA epitopes or the hemagglutinin epitopes. 

28. The vector according to any of claims 18 to 27, wherein the nucleic acid encoding 
the DNA binding protein according to element (5f) is selected from the sequence 
encoding the bacterial LexA protein, the sequence encoding the yeast Gal4 pro- 
tein, more preferably amino acids 1 to 93, most preferably the sequence 
encoding amino acids 1 to 74 of the Gal4 protein. 

29. The vector according to any of claims 18 to 28, wherein the transcriptional 
actiwfer according to element (5g) is selected from the Herpes simplex vims 
protaiti VP16 or more preferably the acidic domain B42. 

30. The vector according to any of claims 18 to 29, wherein the terminator sequence 
according to element (5h) is selected from the CYC1 or ADHI terminator 
sequence 

31 The vector according to any of claims 18 to 30, wherein the vector further 
contains a coding for a first membrane bound protein or part thereof (the bait pro- 
tein). 

32. A vector suitable as a prey vector in a method according to any of claims 1 to 8 or 
a kit according to any of claims 9 to 17 comprising the following elements: 

(1 ) a selection marker for propagation of the vector in E. coli; 

(2) an origin of replication which allows propagation of the vector in E. coli, 

(3) a selection marker for propagation of the vector in yeast; 
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(4) an origin of replication which allows propagation of the vector in yeast; and 

(5) an expression cassette comprising the following elements: 

(a) a promoter element conferring low level expression or inducible ex- 
pression or high level expression; 

(b) an open reading frame encoding the N-terminal part of yeast ubiquitin 
(Nub) or a mutated ubiquitin (NbM); 

(c) a nucleic acid sequence encoding a tag which allows detection of the 
expressed polypeptide fusion; 

(d) a site capable of receiving a nucleic acid sequence encoding a protein; 
and 

(e) a terminator sequence. 

33. The vector according to claim 32, wherein the selection marker according to ele- 
ment (1) is selected from the ampicilline resistance gene, the kanamycine re- 
sistance gene and the chloramphenicol resistance gene. 

34. The vector according to any of claims 32 or 33, wherein the origin of replication 
according to element (2) is selected from an origin such as pUC based or 
pBR322 based origin or preferably an origin of replication allowing a high copy 
number of the plasmid when propagated in E. coli. 

35. The vector according to any of claims 32 to 34, wherein the selection marker ac- 
cording to element (3) is selected from a marker that is selectable in S. 
cerevisiae, preferably a sequence encoding theTRPI gene. 

36. The vector according to any of claims 32 to 35, wherein the origin of replication 
according to element (4) is selected from an origin which allows propagation of 
the vector is S. cerevisiae, preferably the two micron based or CEN/ARS origin of 
replication or more preferably an origin of replication that allows a high copy 
number in S. cerevisiae, preferably the two micron origin of replication. 



AMENDED SHEET 



8 



37. The vector according to any of claims 32 to 36, wherein the promoter element ac- 
cording to element (5a) is selected from CYC1, GALL CUP1, ADH1 or TEF-I. 

38. The vector according to any of claims 32 to 37, wherein the open reading frame 
according to element (5b) encodes a part of the yeast ubiquitin comprising amino 
acids 1 to 37 of the wild-type or said part bearing an amino acid exchange at 
either position 3 or position 13 or both, wherein the amino acid replacing the ori- 
ginal amino acid is preferably selected from leu, val, ala or gly. 

39. The vector according to any of claims 32 to 38, wherein the tag is selected from 
the 3x FLAG epitope, 3x MYC epitope, HA epitopes or an hemagglutinin epitope. 

40. The vector according to any of claims 32 to 39, wherein the site capable of 
receiving a further sequence according to element (5d) is selected from a multiple 
cloning site or a sequence allowing insertion of the further sequence by in vivo re- 
combination. 



41. The vector according to any of claims 32 to 40, wherein the terminator sequence 
is selected from yeast ADH1 orCYCI terminators. 

42. The vector according to any of claims 32 to 41, wherein the vector contains a 
further sequence encoding a protein or part thereof (the prey protein) which pro- 
tein is either membrane bound or soluble. 

43. Host cell containing the vectors and/or plasmids as defined in any of claims 9 to 
42. 

44. Use of the kit according to any of claims 9 to 1 7 or of the vector according to any 
of claims 18 to 41 in a process for detecting interaction between a first membrane 
bound protein or part thereof and a second protein or part thereof which is either 
membrane bound or soluble. 

45. Use of the host cell according to claim 43 in a method for detecting an interaction 
between a first membrane bound protein or part thereof and a second protein or 
part thereof which is either membrane bound or soluble. 
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46. Use of the kit according to any of claims 9 to 17 or the vector according to any of 
claims 18 to 41 or the host cell of claim 43 in a screening process for identifying 
pharmaceutical drugs. 

47. A method for providing a compound capable of interfering with protein/protein 
interaction comprising the following screening steps: 

(1) providing a host cell according to claim 43, the bait and prey polypeptides; 
being; selected such that they interact when expressed. 

(2) incubating said host cell in the presence and absence of the compound(s) 
to be tested; 

(3) measuring the difference in reporter gene expression between the 
incubation containing the compound(s) to be tested and the incubation 
free of the compound(s) to be tested; and optionally 

(4) purifying or synthesizing the compound positively tested with the 
screen. 
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CLAIMS 

1 . A method for detecting an interaction between a first membrane bound protein or 
part thereof and a second protein or part thereof which is either membrane bound 
or soluble, the method comprising: 

(a) providing a host cell containing at least one detectable gene (reporter 
gene) having a binding site for a transcriptional activator, such that the 
detectable gene expresses a detectable product, preferably a protein, 
when the detectable gene is transcriptionally activated; 

(b) (i) providing, as part of a first bait vector, a first chimeric gene under the 
control of a strong promoter, and 

(ii) providing, as part of a second bait vector, a first chimeric gene under 
the control of a weak promoter, 

said first chimeric gene being capable of being expressed in said host cell 
and coding inter alia for a first membrane protein or part thereof which 
gene is attached to the DNA-sequence of a first module encoding inter alia 
a first protein sequence involved in intracellular protein degradation and a 
transcriptional activator, said first protein or part thereof to be tested 
whether it can interact with a second protein or part thereof; 

(c) providing, as part of a prey vector, a second chimeric gene under the 
control of a strong promoter capable of being expressed in said host cell, 
the second chimeric gene coding inter alia for a second protein or part 
thereof which is either membrane bound or soluble and which gene is 
attached to the DNA sequence of a second module encoding inter alia a 
second protein sequence involved in intracellular protein degradation; 

(d) introducing the first repectively the second bait vector and the prey vector 
into the host cell such that an interaction between the expressed first and 
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second protein and/or their parts can take place, which interaction leads to 
an interaction of the first protein sequence of the first module and the 
second protein sequence of the second module which interaction in turn 
leads to activation of an intracellular protease and proteolytic separation of 
the transcriptional activator, at least one of the bait vector and prey vector, 
preferably both, being suitable for being maintained episomally; 

(e) determining whether the detectable gene of the host cell has been 
activated by the transcriptional activator. 

2. The method according to claim 1, wherein the host cell is a yeast, a bacterial or a 
mammalian cell. 

3. The method according to claim 2, wherein the host cell is a Saccharomyces 
pombe cell or, more preferably, cells of the budding yeast Saccharomyces 
cerevisiae. 

4. The method according to any of claims 1 to 3, wherein the detectable gene is 
activated by a natural or artificial activator, preferably an activator comprising a 
short tagging module. 

5. The method according to any of claims 1 to 4, wherein the detectable gene is by 
the artificial transcriptional activator protein A-LexA-V16 (PLV). 

6. The method according to any of claims 1 to 5, wherein the first protein sequence 
comprises a C-terminal portion of ubiquitin (Cub) or a mutant thereof (CbM) and 
the second protein sequence comprises an N-terminal portion of ubiquitin (Nub) 
or a mutant thereof (NbM). 

7. The method according to any of claims 1 to 6, wherein the DNA-sequence coding 
for the first membrane protein is selected from the group consisting of any bacte- 
rial membrane protein, any viral membrane protein, any oncogene-encoded 
membrane protein, any growth factor receptor or any eukaryotic membrane pro- 
tein, or parts thereof. 

8. The method according to any one of the preceding claims wherein the second 
membrane protein or the soluble protein, or part thereof, is encoded by a library: 
plasmids. 
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9. A kit for detecting binding between a first membrane bound protein or part thereof 
a second protein or part thereof which is either membrane bound or soluble 
comprising: 

(a) a host cell containing at least one detectable gene (reporter gene) having 
a binding site for a transcriptional activator, such that the detectable gene 
expresses a detectable product, preferably a protein, when the detectable 
gene is transcriptionally activated; 

(b) a first vector (bait) comprising a first site capable of receiving a first nucleic 
acid coding for a first membrane protein or part thereof such that when the 
first nucleic acid is inserted it becomes attached to the DNA sequence of a 
first module encoding inter alia a first protein sequence involved in 
intracellular protein degradation, the module further comprising a nucleic 
acid for a transcriptional activator and a promoter; 

(c) a second vector (prey) comprising a second site capable of receiving a se- 
cond nucleic acid coding for a second membrane protein or a soluble pro- 
tein or part thereof such that when the second nucleic acid is inserted it 
becomes attached to the DNA sequence of a second module encoding 
inter alia a sequence protein sequence involved in intracellular protein 
degradation, wherein the module further comprises a promoter; and 
optionally 

(d) a plasmid library encoding second proteins or parts thereof. 

10. The kit according to claim 9, wherein the host cell is a yeast, bacterial or mamma- 
lian cell. 

11. The kit according to claim 10, wherein the host cell is a yeast cell, preferably of 
Saccharomyces pombe and most preferably of the budding yeast 
Saccharomyces cerevisiae. 

12. The kit according to any of claims 9 to 11, wherein the detectable gene can be 
activated by a natural or artificial activator, preferably an activator comprising a 
short tagging module. 
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13. The kit according to any of claims 9 to 12, wherein the detectable gene can be 
activated by the artificial transcriptional activator protein A-LexA-V16 (PLV). 

14. The kit according to any of claims 9 to 13, wherein the first protein sequence 
contains the C-terminal part of ubiquitin (Cub) or a mutant thereof and the second 
protein sequence contains the N-Terminal part of ubiquitin (Nub) or a mutant the- 
reof. 



15. The kit according to any of claims 9 to 14, wherein the promoter is selected from 
the group consisting of ADH promoter, CYC1 promoter or TEF1 promoter. 

16. The kit according to any of claims 9 to 15, wherein the DNA-sequence coding for 

the first membrane protein is derived from any bacterial membrane protein, any 
viral membrane protein, any oncogene-encoded membrane protein, any growth 
factor receptor or any eukaryotic membrane protein, or parts thereof. 



17. The kit according to any of claims 9 to 16 wherein the DNA-sequence coding for 
the second protein is contained in a library of plasmids. 

18. A vector useful as a bait vector in a method according to any of claims 1 to 8 or a 
kit according to any of claims 9 to Comprising the following elements: 



(1 ) a selection marker for propagation of the vector in E. coli; 

(2) an origin of replication which allows propagation of the vector in E. coli; 

(3) a further selection marker for propagation of the vector in yeast; 

(4) an origin of replication which allows propagation of the vector in yeast; and 

(5) an expression cassette comprising the following elements: 

(a) a promoter element conferring low level expression or inducible 
expression or high level expression; 



(b) a nucleic acid sequence encoding a leader; 



(c) a site capable of receiving a nucleic acid encoding a protein; 



AMENDED SHEET 



)2-2004 



5 



EP0303287 



(d) a nucleic acid sequence encoding the C-terminal part of yeast ubiquitin 
(Cub) or mutated yeast ubiquitin (CbM); 

(e) a nucleic acid sequence encoding a tag that allows detection of the fusion 
protein; 

(f) a nucleic acid sequence encoding a DNA binding protein; 

(g) a nucleic acid sequence encoding a transcriptional activator; and (h) a 
terminator sequence. 

19. The vector according to claim 18, wherein the selection marker according to 
element (1) comprises a marker selected from the ampicilline resistance gene, 
the kanamycine resistance gene or the chloramphenicol resistance gene. 

20. The vector according to any of claims 18 or 19, wherein the origin of replication 
according to element (2) is selected from a pUC based or pBR322 origin of 
replication. 

21. The vector according to any of claims 18 to 20, wherein the selection marker 
according to element (3) is selected from any marker that is selectable in S. 
cerevisiae, preferably the LEU2 gene. 

22. The vector according to any of claims 18 to 21, wherein the origin of replication 
according to element (4) is a low copy number origin, preferably selected from 
CEN and ARS origins. 

23. The vector according to any of claims 18 to 22, wherein the promoter according 
to element (5a) is selected from CYC1, GAL1, CUP1, ADH1 or TEFL 

24. The vector according to any of claims 18 to 23, wherein the nucleic acid sequence 

encoding a leader according to element (5b) is selected from a signal sequence 
derived from a yeast integral membrane protein or a signal sequence which 
confers fatty acid modification preferably to the following peptide (N- 
MGCTLSAEDKPGGP-C). 

25. The vector according to any of claims 18 to 24, wherein the site for receiving a 
nucleic acid sequence according to feature (5c) is a multiple cloning site or a se- 
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quence allowing the insertion of a further nucleic acid sequence by in vivo recom- 
bination. 

26. The vector according to any of claims 18 to 25, wherein the nucleic acid 
sequence to element (5d) encodes the C-terminal part of yeast ubiquitin from 
amino acids 35 to 76. 

27. The vector according to any of claims 18 to 26, wherein the tag for detecting the: 

expression of the polypeptide fusion is selected from epitopes of Gal4, LexA, B42 
or VP16 or 3x FLAG, 3x MYC or HA epitopes or the hemagglutinin epitopes. 

28. The vector according to any of claims 18 to 27, wherein the nucleic acid encoding 
the DNA binding protein according to element (5f) is selected from the sequence 
encoding the bacterial LexA protein, the sequence encoding the yeast Gal4 pro- 
tein, more preferably amino acids 1 to 93, most preferably the sequence 
encoding amino acids 1 to 74 of the Gal4 protein. 

29. The vector according to any of claims 18 to 28, wherein the transcriptional 
activator according to element (5g) is selected from the Herpes simplex virus 
protein VP16 or more preferably the acidic domain B42. 

30. The vector according to any of claims 18 to 29, wherein the terminator sequence 
according to element (5h) is selected from the CYC1 or ADHI terminator 
sequence 

31. The vector according to any of claims 18 to 30, wherein the vector further 
contains a coding for a first membrane bound protein or part thereof (the bait pro- 
tein). 

32. A vector suitable as a prey vector in a method according to any of claims 1 to 8 or 
a kit according to any of claims 9 to 17 comprising the following elements: 

(1) a selection marker for propagation of the vector in E. coli; 

(2) an origin of replication which allows propagation of the vector in E. coli, 

(3) a selection marker for propagation of the vector in yeast; 
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(4) an origin of replication which allows propagation of the vector in yeast; and 

(5) an expression cassette comprising the following elements: 

(a) a promoter element conferring low level expression or inducible ex- 
pression or high level expression; 

(b) an open reading frame encoding the N-terminal part of yeast ubiquitin 
(Nub) or a mutated ubiquitin (NbM); 

(c) a nucleic acid sequence encoding a tag which allows detection of the 
expressed polypeptide fusion; 

(d) a site capable of receiving a nucleic acid sequence encoding a protein; 
and 

(e) a terminator sequence. 

33. The vector according to claim 32, wherein the selection marker according to ele- 
ment (1) is selected from the ampicilline resistance gene, the kanamycine re- 
sistance gene and the chloramphenicol resistance gene. 

34. The vector according to any of claims 32 or 33, wherein the origin of replication 
according to element (2) is selected from an origin such as pUC based or 
pBR322 based origin or preferably an origin of replication allowing a high copy 
number of the plasmid when propagated in E. coli. 

35. The vector according to any of claims 32 to 34, wherein the selection marker ac- 
cording to element (3) is selected from a marker that is selectable in S. 
cerevisiae, preferably a sequence encoding the TRP1 gene. 

36. The vector according to any of claims 32 to 35, wherein the origin of replication 
according to element (4) is selected from an origin which allows propagation of 
the vector is S. cerevisiae, preferably the two micron based or CEN/ARS origin of 
replication or more preferably an origin of replication that allows a high copy 
number in S. cerevisiae, preferably the two micron origin of replication. 
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37. The vector according to any of claims 32 to 36, wherein the promoter element ac- 
cording to element (5a) is selected from CYC1 , GAL1 , CUP1 , ADH1 or TEF-I. 

38. The vector according to any of claims 32 to 37, wherein the open reading frame 
according to element (5b) encodes a part of the yeast ubiquitin comprising amino 
acids 1 to 37 of the wild-type or said part bearing an amino acid exchange at 
either position 3 or position 13 or both, wherein the amino acid replacing the ori- 
ginal amino acid is preferably selected from leu, val, ala or gly. 

39. The vector according to any of claims 32 to 38, wherein the tag is selected from 
the 3x FLAG epitope, 3x MYC epitope, HA epitopes or an hemagglutinin epitope. 



40. The vector according to any of claims 32 to 39, wherein the site capable of 
receiving a further sequence according to element (5d) is selected from a multiple 
cloning site or a sequence allowing insertion of the further sequence by in vivo re- 
combination. 



41. The vector according to any of claims 32 to 40, wherein the terminator sequence 
is selected from yeast ADH1 or CYC1 terminators. 

42. The vector according to any of claims 32 to 41, wherein the vector contains a 
further sequence encoding a protein or part thereof (the prey protein) which pro- 
tein is either membrane bound or soluble. 



43. Host cell containing the vectors and/or plasmids as defined in any of claims 9 to 
42. 

44. Use of the kit according to any of claims 9 to 17 or of the vector according to any 
of claims 18 to 41 in a process for detecting interaction between a first membrane 
bound protein or part thereof and a second protein or part thereof which is either 
membrane bound or soluble. 



45. Use of the host cell according to claim 43 in a method for detecting an interaction 
between a first membrane bound protein or part thereof and a second protein or 
part thereof which is either membrane bound or soluble. 
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46. Use of the kit according to any of claims 9 to 17 or the vector according to any of 
claims 18 to 41 or the host cell of claim 43 in a screening process for identifying 
pharmaceutical drugs. 

47. A method for providing a compound capable of interfering with protein/protein 
interaction comprising the following screening steps: 

(1) providing a host cell according to claim 43, the bait and prey polypeptides; 
being; selected such that they interact when expressed. 

(2) incubating said host cell in the presence and absence of the compound(s) 
to be tested; 

(3) measuring the difference in reporter gene expression between the 
incubation containing the compound(s) to be tested and the incubation 
free of the compound(s) to be tested; and optionally 

(4) purifying or synthesizing the compound positively tested with the 
screen. 
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5 METHOD AND KIT FOR DETECTING MEMBRANE PROTEIN - PROTEIN 

INTERACTIONS 



The present invention is concerned with a method for detecting membrane protein - pro- 
10 tein interactions with an in vivo genetic system based in yeast, bacterial or mammalian 
cells. Furthermore, the invention provides a kit for detecting the interactions between a 
first membrane protein and a second protein using reconstitution of a protein involved in 
intracellular protein degradation such as the split ubiquitin protein. The reconstitution of 
split ubiquitin makes use of chimeric genes, which express hybrid proteins. 

An important area in biology is the analysis of interactions between prate?- v roteips are 
complex macromolecules made up of covalently linked chains of amino >vw;. Each pro- 
tein assumes a unique three-dimensional shape principally determined by its sequence of 
amino acids. Many proteins consist of smaller units called domains, which are continuous 
20 stretches of amino acids able to fold independently from the rest of the protein (e.g. im- 
munoglobulin domains, immunoglobulin-like domains, GTPase domains, SH2 and SH3 
domains). 

Interactions between proteins mediate most processes in a living cell. They are involved, 
25 for example, in the assembly of enzyme subunits, antigen-antibody reaction, in forming 
the supramolecular structures of ribosomes, filaments and viruses. A special and specific 
role can be attributed to membrane proteins. They are involved in the transport of mole- 
cules; in the interaction of receptors on the cell surface with growth factors and hor- 
mones; membrane-bound oncogene products can give rise to neoplastic transformation 
30 through protein-protein interactions with proteins called kinases whose enzymatic activity 
on cellular target proteins leads to a cancerous state. Other examples of a protein-protein 
interaction in membranes occur when a virus infects a cell by recognizing a protein (re- 
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ceptor) on the surface, and this interaction has been used to design antiviral agents. 
There are two types of transmembrane proteins: Type I transmembrane proteins have 
their C-terminus in the cytoplasm, whereas the Type II transmembrane proteins have 
their C-terminus outside the cell (or in the inner part of some other organelle, for example 
in the lumen of the endoplasmic reticulum). 

Protein-protein interactions have been generally studied in the past ten years by using 
biochemical techniques such as cross-linking, co-immunoprecipitation and co- 
fractionation by chromatography. Biochemical methods have the disadvantage that inter- 
acting proteins are generally known as bands of a particular mobility on a polyacrylamide 
gel. To progress from these bands to cloned genes is often a very tedious process. 

A genetic system that is capable of rapidly detecting which proteins interact with a known 
protein, determining which domains of the proteins interact, and providing the genes for 
the newly identified interacting proteins has been developed in 1989 by Stan Fields and 
Ok-Kyu Song [Fields, S. and Song, O.- K., Nature ?4Q, 245-248 (1989)]. Their system,, 
termed yeast two-hybrid system, is based on i:W*tfiution of a transcriptional activator 
and transcriptional activation of reporter genes. The yeast two-hybrid system is a power- 
ful method for the in vivo analysis of protein-protein interactions, but is naturally limited to 
the analysis of soluble proteins or soluble domains of membrane proteins, i.e. interactions 
between integral membrane proteins cannot be studied. In addition, the hybrid proteins 
are targeted to the nucleus where the interactions take place. Thus, the interactions that 
are dependent on post-translational modifications that take place within the endoplas- 
matic reticulum, such as glycosylate and disulfide bond formation, may not occur. 

The split-ubiquitin system represents an alternative assay for the in vivo analysis of pro- 
tein interactions. It was developed in 1994 by Nils Johnsson and Alexander Varshavsky 
[Johnsson, N. and Varshavsky, A., Proc. Natl. Acad. Sci. USA 91, 10340-10344 (1994)] 
for the detection of interactions between soluble proteins (Figure 1). Ubiquitin (Ub) is a 76 
amino acid residue, single domain protein that is present in cells either free or covalently 
linked to other proteins. Ubiquitin plays a role in a number of processes primarily through 
routes that involve protein degradation. In eukaryotes, newly formed Ub fusions are rap- 
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idly cleaved by ubiquitin specific proteases (UBPs) after the last residue of Ub at the Ub- 
polypeptide junction. The cleavage of a UB fusion by UBPs requires the folded conforma- 
tion of Ub. When a C-terminal fragment of ubiquitin (CbM) is expressed as a fusion to a 
reporter protein, the fusion is cleaved only if an N-terminal fragment of Ubiquitin (NbM) is 
also expressed in the same cell. This reconstitution of native ubiquitin from its fragments, 
detectable by the in vivo cleavage assay, is not observed with a mutationally altered 
NbM. However, if CbM and the altered NbM are each linked to polypeptides that interact 
in vivo, the cleavage of the fusion containing CbM is restored, yielding a generally appli- 
cable assay for detecting the protein-protein interactions (Figure 1 ). 

The system was subsequently modified and shown to work with membrane proteins [I. 
Stagljar et al., Proc. Natl. Acad. Sci. USA 95, p. 5187-5192, 1998)]. Three yeast mem- 
brane proteins of the endoplasmic reticulum have been used as a model system. Wbplp 
and Ostlp are both subunits of the oligosaccharyl transferase membrane protein com- 
plex. The Alg5 protein also localizes to the membrane of the endoplasmic reticulum, but 
does not interact with v . , Hgosaccharyltransferase. Specific interactions were detected v 
between Wbplp and CsMpy but not between Wbplp and Alg5p. Therefore, the modified , 
split-ubiquitin system works as a detection system for interactions involving membrane- 
associated proteins. In contrast to the conventional two-hybrid system, which requires 
nuclear localization, the interactions are detected in their natural environment of the pro- 
tein of interest. According to this prior art, the constructs used for establishing the split- 
ubiquitin system were designed for integration into the host genome. 

However, none of the aforementioned prior art suggests a genetic method that works for 
the detection of in vivo membrane protein - cytosol protein interactions as well as for 
membrane protein - membrane protein interactions using transcriptional activation as an 
assay. Moreover, the above prior art showed poor reproducibility since the signal 
strength obtained largely depended on the site of integration of the constructs into the 
host genome. 

It is therefore an object of the present invention to provide a method for detecting in vivo 
protein interactions of membrane proteins with membrane proteins as well as membrane 
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proteins with cytosolic (soluble) proteins which method is free of the weaknesses of the 
prior art. 

A further object of the present invention is to provide a method for the screening of li- 
braries for identifying molecules which may interact with a membrane-bound protein. 



Yet another object of the present invention is to provide a method which can be used to 
identify compounds that are capable of interfering with protein-protein interactions, 
wherein at least one protein of the interaction is a membrane bound protein. 
10 Yet another object of the present invention is to provide a method which can be used in 
the identification/design of peptides capable of interfering with protein-protein interac- 
tions, wherein at least one protein of the interaction is a membrane bound protein. 

Yet another object of the present invention is to provide a method which can be used in 
15 the identification/design of scFVs or antibodies that are capable of interfering with protein- 
riein interactions, wherein at least one protein of the interaction is a membrane bound 

protein. 

Yet another object of the present invention is to provide a method which can be used to 
20 identify compounds that bind to and activate G protein-coupled receptors (GPCRs) or that 
selectively bind to and activate one GPCR but not another GPCR that is coexpressed 
within the same cell. 



25 



30 



These and other objects are achieved by the present invention, which provides a method 
according to claim 1, a kit for detecting interactions between either two membrane pro- 
teins or one membrane and one cytosolic protein according to claim 9, and vectors ac- 
cording to claim 18 and 32. The method is based on the reconstitution of a protein in- 
volved in intracellular protein degradation, such as ubiquitin and which makes use of 
chimeric genes, which express hybrid proteins. Two types of hybrid proteins are pre- 
pared. The first hybrid contains a membrane protein of interest (bait) fused to e.g. the 
CbM-TDA module (containing C-terminal domain of ubiquitin (CbM) followed e.g. by an 
artificial transcriptional activator (e.g. LexM-B42)). The second hybrid protein (prey) con- 
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tains e.g. an N-terminal domain of ubiquitin (NbM) fused to the second test protein. The 
prey protein can be either a membrane protein or a soluble cytoplasmic protein. If two 
test proteins are able to interact, they reconstitute two separate ubiquitin domains into an 
active ubiquitin leading to the cleavage of the transcriptional activator and activation of 
5 the reporter system. 

One advantage of this method is that a multiplicity of membrane proteins can be simulta- 
neously tested to determine whether any interact with a known protein. For example, a 
DNA fragment encoding the membrane protein of interest (bait) is fused to a DNA frag- 

10 ment encoding the CbM-LexM-B42 fusion. This hybrid is introduced into the host cell 
(yeast, bacterial or mammalian cells) carrying (a) marker gene(s). For the second partner, 
(prey), a library of plasmids can be constructed which may include, for example, total 
human complementary DNA (cDNA) fused to the DNA sequence encoding the N-terminal 
domain of Ubiquitin (NbM). This library is introduced into the yeast cells carrying bait 

15 protein. If any individual plasmid from the library encodes a protein that is capable of in- 
teracting with the membrane bait protein, a positive signal will be obtained. In ^Ition, 
when an Interaction between proteins occurs, the gene for the newly identifier y,^n is 
available. 

20 The system further allows the identification of compounds which are capable of interfering 
with the protein/protein interaction as described above. In such an approach, the method 
according to the invention is performed in the presence and absence of a compound and 
compound library, respectively, and it is determined as to whether the presence of such a 
compound may alter the signal obtained for a positively measured protein/protein interac- 

25 tion in the absence of any compound to be tested. 

The system can be of value in the identification of new genes. For example, membrane 
bound receptors may be identified that interact with a known membrane protein. Proteins 
that interact with oncogene-encoded membrane proteins may be discovered, and these 
30 proteins will be of therapeutic value. 



The system can be used in the design of peptide/small molecule inhibitors. For example, 
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peptides/small molecules that interact with membrane-bound growth factor receptors can 
be identified and then tested in other systems for their ability to inhibit the signal trans- 
duction. Peptides/small molecules that bind to bacterial or viral membrane proteins can 
be identified and then tested in other systems for their ability to inhibit these bacteria or 
5 viruses. 

The system can be used to test affinity reagents for protein purification. Peptides or pro- 
tein domains can be identified that interact with the known membrane protein of interest 
and these may then be used in a purification protocol for the known protein. 

10 

The term M NbM" as used herein refers to the N-terminal portion of yeast ubiquitin, which 
encompasses amino acids 1-37 of yeast ubiquitin. „NbM" contains either the amino acids 
1-37 of wild type ubiquitin, or the amino acids 1-37 of mutated ubiquitin where e.g. the 
amino acid isoleucine at position 13 has been exchanged for any of the following amino 
15 acids: leucine, valine, alanine or glycine, or amino acids 1-37 of ubiquitin where the 
amino acid isoleucine at position 3 has been exchanged for any of the following amino ^ 
acids: leucine, valine, alanine or glycine, or most pr?**#S^y, amino acids 1-37 of ubiquitin % 
where the amino acids isoleucine at position 3 and 13 have been exchanged for any of 
the following amino acids: leucine, valine, alanine or glycine. 

20 

The term "CbM" as used herein refers to the C-terminal portion of yeast ubiquitin, which 
encompasses amino acids 35-76 of yeast ubiquitin or amino acids 35-76 of mutated 
ubiquitin, where e.g. the amino acid lysine at position 48 has been changed to the amino 
acid glycine using standard site-directed mutatgenesis techniques (following, for instance, 

25 directions described in the STRATAGENE Quickchange Mutagenesis Kit, Stratagene, 
CA, USA). The exchange of lysine at position 48 for the inert glycine represents an im- 
portant advantage over other variations of the Split-ubiquitin technique (Dunnwald et al., 
1999, Johnsson & Varshavsky, 1994, Laser et al., 2000, Stagljar et al.. 1998, Wittke et 
al., 1999) because it prevents modification of CbM via the attachment of ubiquitin moie- 

30 ties. The poly-ubiquitination of unmodified CbM may interfere with the binding to NbM and 
may lead to the degradation of the bait polypeptide by enzymes of the ubiquitination 
pathway (Hershko & Ciechanover, 1992). 
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The term "bait" as used in this document defines a fusion of a polypeptide and one or 
more other polypeptides, one of which is a first protein sequence involved in intracellular 
protein degradation such as CbM. The bait can be used in the membrane-based yeast 
5 two-hybrid system (MbY2H system) to investigate interactions between said bait and one 
or several preys. 

The term "bait vector" as used in this document refers to a nucleic acid construct which 
contains sequences encoding "the bait" and regulatory sequences that are necessary for 
10 the transcription and translation of the encoded sequences by the host cell, and prefera- 
bly regulatory sequences that are needed for the propagation of the nucleic acid con- 
struct in yeast and E. coli. Preferably the "bait vector also encodes the activator of the 
host reporter gene(s). 

15 The term "prey as used in this document defines a fusion between a polypeptide and 
one or more other polypepft » one of which is a second protein sequence involved in 
intracellular protein degraoafe such as NbM. NbM may be either wild type NbM (then 
termed NbM) or a mutated version of NbM, where one or several of its amino acids have 
been replaced by other amino acids, as described in detail for "NbM" above. 

20 

The terms "prey vector" and "library vector as used herein refer to a nucleic acid con- 
struct which contains sequences encoding the "prey" and regulatory sequences that are 
necessary for the transcription and translation of the encoded sequences encoding by the 
host cell, and preferably regulatory sequences that are needed for the propagation of the 
25 nucleic acid construct in yeast and E. coli. 



- at: 



The term "Split-ubiquitin" as used herein refers to quasi-native yeast ubiquitin assembled 
30 from noncovalently linked NbM and CbM, which have been brought into close spatial 
proximity by the interaction of two unrelated polypetides that are fused to CbM and NbM, 
respectively. Split-ubiquitin is recognized by ubiquitin-specific proteases present in the 
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yeast cell, which attack the polypeptide chain C-terminal to the double glycine motif in 
CbM. The proteolytic cleavage leads to the breakage of the polypeptide chain after the 
double glycine motif. 

The term "transactivator polypeptide" or "activator as used in this document refers to any 
polypeptide that possesses the ability to activate the "reporter gene" of the host cell, e.g. 
by recruiting and activating the RNA Polymerase II machinery of yeast. Preferably, the 
transactivator polypeptide is the Herpes simplex virus VP16 protein and most preferably, 
it is the acidic B42 domain. 

The term "LexM" as used in this document refers to the nucleic acid sequence encoding 
the bacterial repressor protein LexA or its translation product, where the sequence may 
either encode the wild type LexA sequence or a mutated LexA sequence where the 
amino acid arginine at position 157 has been replaced by a glycine or where the amino 
acid arginine at position 159 has been replaced by glutamate or glycine, or any combina- 
tion ~r:he two mutations. 

The term Gal1-74 as used in this document refers to the nucleic acid sequence encoding 
the yeast protein Gal4, corresponding to amino acids 1 -74 of its translation product. 

The term Gal1-93 as used in this document refers to the nucleic acid sequence encoding 
the yeast protein Gal4, corresponding to amino acids 1-93 of its translation product 

The term "artificial transcription factor" as used in this document refers to a hybrid protein 
consisting of (1) a polypeptide with the intrinsic ability to bind to a defined nucleic acid 
sequence, such as the bacterial repressor protein LexA or the yeast Gal4 protein or the 
Drosophila melanogaster proteins Ubx and abd-A and (2) any transactivator polypeptide, 
as defined above. 

The term "TDA" as used in this document refers to a nucleic acid sequence or its transla- 
tion product, comprising the following elements: (1) an epitope tag which allows the im- 
munological detection of the polypeptide by means of an antibody directed specifically 



WO 03/083136 




T/EP03/03287 



9 



against the epitope, such as HA tag, 3xFLAG tag, 3xMyc tag, (2) an artificial transcription 
factor. 

The term "reporter strain" as used in this document refers to any host that containes at 
least one "reporter gene" such as a yeast strain that contains nucleic acid constructs, ei- 
ther integrated into its genome, or as autonomously replicating elements, that produce a 
signal upon activation by the activator, e.g. by conferring the ability to grow on selective 
medium when activated or leading to cell death or cell cycle arrest when activated or 
leading to production of enzymes such as p-galactosidase. Preferably, the host cell 
comprises more than one reporter gene. 

The term "reporter gene" as used in this document refers to a nucleic acid sequence 
comprising the following elements: (1) a binding site for an artifical transcription factor, (2) 
a minimal promoter sequence, which may be any sequence derived from a yeast pro- 
moter, but which preferably is any of GAL1 promoter, GAL2 promoter, CYC1 promoter, 
SP01 promoter, HIS3 promoter, (3) a nucleic acid sequence encoding a polypepti^ 
which can be selected for or against in yeast Or which has an enzymatic activity tftf&0$t~ 
be measured using an appropriate assay, such as HIS3, ADE2, URA3, FAR1, lacZ, and 
(4) a terminator sequence from a yeast gene, such as CYC1 or ADH1. The nucleic acid 
sequence may be integrated into the genome of a yeast reporter strain or it may supplied 
on an autonomously replicating plasmid. 

The term "FAR1" refers to the nucleic acid sequence encoding the yeast FAR1 gene or to 
its gene product, where the amino acid serine at position 87 has been replaced by the 
amino acid proline in order to stabilize the protein and prevent its degradation by the 
proteasome machinery for yeast. 

The term "5-FOA" as used in this document refers to the compound 5-fluoroorotic acid, 
which may be converted into toxic metabolites by the action of the URA3 gene product, 
leading to cell death. 
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As used herein, "first membrane bound protein or part thereof refers to a protein which is 
bound to or integrated into the membrane of cells such as the type I and type II trans- 
membrane membranes or the so-called "selfactivators" which are soluble proteins having, 
however, been modified by the attachment of e.g a myristylation site, which site attaches 
the soluble "selfactivator" to the cell membrane. In the context of the present invention, 
any "part" of such protein can be used as long as it is capable of being attached to the 
membrane and capable of interacting with the second protein strongly enough so that 
split-ubiquitin is formed. 

As used herein, the "second protein or part thereof refers to a protein which can interact 
with the "first membrane bound protein or part thereof and which protein can be also a 
membrane bound protein or a soluble protein. In the context of the present invention, a 
part of such protein is sufficient as long as it is capable of interacting with the "first mem- 
brane bound protein or part thereof, strong enough that split-ubiquitin is formed. 

"First protein sequence involved in intracellular protein degradation" and "second protein 
sequence involved in intracellular protein degradation" rptti**arts of a protein which, 
when brought together in a host cell, e.g. by the interaction of the first protein to be tested 
and the second protein to be tested, reconstitutes a structure which is capable of activat- 
ing an intracellular protein degradation machinery such as the ubiquitin depending prote- 
ases. 

A method for detecting the interaction between a first membrane protein and a second 
(membrane or soluble) protein is provided in accordance to the present invention which 
method comprises the following steps: 

(a) providing a host cell containing at least one detectable gene (reporter gene) having 
a binding site for a transcriptional activator, such that the detectable gene expresses 
a detectable product, preferably a protein, when the detectable gene is transcrip- 
tionally activated; 
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(b) providing, as part of a bait vector, a first chimeric gene capable of being expressed 
in said host cell, the first chimeric gene coding inter alia for a first membrane protein 
or part thereof which gene is attached to the DNA-sequence of a first module en- 
coding inter alia a first protein sequence involved in intracellular protein degradation 
and a transcriptional activator, said first protein or part thereof to be tested whether it 
can interact with a second protein or part thereof; 



(c) 

10 



30 



providing, as part of a prey vector, a second chimeric gene capable of being ex- 
pressed in said host cell, the second chimeric gene coding inter alia for a second 
protein or part thereof which is either membrane bound or soluble and which gene is 
attached to the DNA sequence of a second module encoding inter alia a second 
protein sequence involved in intracellular protein in degradation; and 



(d) introducing the bait vector and the prey vector into the host cell such that an interac- 
15 tion between the expressed first and second protein and/or their parts can take 

place, which interaction le^feari interaction of the first protein sequence of the ; . % 
first module and the secona" pmiein sequence of the second module which interac- . , 
tion in turn leads to activation of an intracellular protease and proteolytic separation 
of the transcriptional activator, at least one of the bait and prey vectors, preferably 
20 both, being suitable for being maintained episomally. 

The method is set up in yeast, preferably in Schizosaccharomyces pombe, most prefera- 
bly in the budding yeast Saccharomyces cerevisiae, but can be set up as well in bacteria 
such as Escherichia coli and mammalian cell systems. The host cell contains a detect- 
25 able gene having a binding site for a transcriptional activator, preferably TDA, such that 
the detectable gene expresses a detectable protein when the detectable gene is tran- 
scriptionally activated. 



The first chimeric gene is provided which is capable of being expressed in the host cell. 
The first chimeric gene contains a DNA coding for a first membrane protein fused to TDA, 
and a first protein sequence involved in intracellular protein degradation, such as CbM. 
This first protein is then tested for interaction with a second protein or protein fragment. 
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A second chimeric gene is provided which is capable of being expressed in the host cell. 
The second chimeric gene contains a DNA sequence that encodes a second hybrid pro- 
tein. The second hybrid protein contains e.g. an N-terminal domain of Ubiquitin (NbM). 

5 The second hybrid protein also contains a second protein or a protein fragment which is 
to be tested for interaction with the first protein or protein fragment. The second hybrid 
protein may be encoded in a library of plasmids that contain genomic, cDNA or syntheti- 
cally generated DNA sequences fused to the DNA sequence encoding the N-terminal 
domain of Ubiquitin (NbM). The interaction between the first membrane protein and the 

10 second protein in the host cell, therefore, causes the cleavage of the transcriptional acti- 
vator that activates transcription of the reporter gene(s). The method may be carried out 
by introducing the first chimeric gene and the second chimeric gene into a yeast reporter 
strain. The host cell is subjected to conditions under which the first membrane protein 
and the second protein are expressed in sufficient quantity for the reporter gene to be 

15 activated. The cells are then tested for the expression of a reporter gene, which is acti- 
vated t greater degree than in the absence of an interaction between the first protein 
and tha ;^iahd protein. 

In this way interactions between a first membrane protein and a library of proteins can be 
20 tested. For example, the first membrane protein may be derived from a bacterial mem- 
brane protein, a viral membrane protein, an oncogene-encoded membrane protein, a 
growth-factor receptor or any eukaryotic membrane protein. The second protein may be 
derived from a library of plasmids as described above. 

25 The method of the present invention may be practiced using a kit tor detecting interaction 
between a first membrane protein and a second protein comprising the following ele- 
ments: 

(a) a host cell containing at least one detectable gene (reporter gene) having a binding 
30 site for a transcriptional activator, such that the detectable gene expresses a de- 

tectable product, preferably a protein, when the detectable gene is transcriptionally 
activated; 



15 
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(b) a first vector (bait) comprising a first site capable of receiving a first nucleic acid 
coding for a first membrane protein or part thereof such that when the first nucleic 
acid is inserted it becomes attached to the DNA sequence of a first module en- 

5 coding inter alia a first protein sequence involved in intracellular protein degrada- 

tion, the module further comprising a nucleic acid for a transcriptional activator, 

(c) a second vector (prey) comprising a second site capable of receiving a second 
nucleic acid coding for a second membrane protein or a soluble protein or part 

10 thereof such that when the second nucleic acid is inserted it becomes attached to 

the DNA sequence of a second module encoding inter alia a second protein se- 
quence involved in intracellular protein degradation; and optionally 



(d) a plasmid library encoding second proteins or parts thereof. 



The kit may include a container, vectors and a host cell. The vectors for the membrane 
based detection system include the bait vector and prey vector and optionally a plasmM 
library, such vectors are for example shown in Figure 2 as vector (p-Y-CbM-TDA), that 
allows to assay any Type I transmembrane protein (Y), contains a yeast promoter se- 

20 lected from the group consisting of the ADH promoter, CYC1 promoter and TEF pro- 
moter, followed by unique restriction sites for inserting a DNA sequence encoding a 
membrane protein in such a manner that the first protein is expressed as a fusion to the 
CbM-LexM-B42 portion. The first vector also contains a terminator sequence which is 
necessary to terminate the transcription of a given membrane protein. The first vector in 

25 one embodiment does not include a sequence that allows its replication in yeast, but 
preferably contains such a sequence^analternative embodiment, allowing episomal repli- 
cation of the vector. This vector may be an integrative vector that has to be stable inte- 
grated in the yeast genome. Also included on the first vector is a first marker gene (e.g. 
LEU2), the expression of which in the host cell permits selection of cells containing the 

30 first marker gene from cells that do not contain the first marker gene. 



A second vector (e.g. pTDA-CbM-Y), that allows to assay any Type II transmembrane 
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protein (Y), contains a yeast promoter selected from the group consisting of the ADH 
promoter, CYC1 promoter and TEF promoter, followed by unique restriction sites for in- 
serting a DNA sequence encoding a membrane protein in such a manner that the first 
protein is expressed as a fusion to LexM-B42-CbM portion (Figure 2). Note that there is 

5 an inverted orientation of the transcription factor fused to the TDA portion. The second 
vector might also contain a terminator sequence which is necessary to terminate the 
transcription of a given test membrane protein. The second vector, in a first embodiment, 
does not include a sequence that allows its replication in yeast. However, a second em- 
bodiment includes a sequence which allows episomal replication of the vector. As the 

10 first vector, this vector may be an integrative vector that has to be stable integrated in the 
yeast genome. Also included on the second vector is a marker gene (LEU2), the expres- 
sion of which in the host cell permits selection of cells containing the first marker gene 
from cells that do not contain the first marker gene. 

15 A third vector (e.g. pX-HA-NbM) that allows the cloning of the prey protein (X), may be a 
transmembrane protein or soluble (cytoplasmic) protein (Figure 2). The test protein may..., 
be encoded in a library of plasmids that contain genomic, c£ 4,^ or synthetically gener-w 
ated DNA sequences fused to the NbM domain. The third vector also includes a promoter 
preferably selected from the group consisting of the ADH promoter, CYC1 promoter and 

20 TEF promoter, and does include a transcription termination signal to direct transcription. It 
also includes a DNA sequence that encodes the N-terminal domain of Ubiquitin (NbM) 
and a unique restriction site to insert a DNA sequence encoding the second protein or 
protein fragment into the vector. Thus, the third vector allows the cloning of the protein as 
an N-terminal fusion to the NbM domain. The third vector further preferably includes a 

25 means for replicating itself in the host cell, i.e. yeast or bacteria. It also includes a second 
marker gene such as (TRP1), the expression of which in the host cell permits selection of 
cells containing the second marker gene from cells that do not contain the second marker 
gene. 

30 A fourth vector (pNbM-HA-X) allows the cloning of the prey protein (X), a transmembrane 
protein or soluble (cytoplasmic) protein, as a C-terminal fusion to the NbM domain (Figure 
2). The fourth vector also includes a promoter preferably selected from the group con- 
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sisting of the ADH promoter, CYC1 promoter and TEF promoter, and transcription termi- 
nation signal to direct transcription. It also includes a DNA sequence that encodes the N- 
terminal domain of ubiquitin (NbM) and a unique restriction site to insert a DNA sequence 
encoding the second protein or protein fragment into the vector. The fourth vector further 
5 preferably includes a means for replicating itself in the host cell and in bacteria. It also 
includes a second marker gene (such as TRP1), the expression of which in the host cell 
permits selection of cells containing the second marker gene from cells that do not con- 
tain the second marker gene. 

10 While vectors published by Stagljar et al. [Proc. Natl. Acad. Sci. USA 95, 5187-5192 
(1998)] enabled the expression of the yeast Wbplp, Ostlp and Alg5p only, the newly 
designed vectors enable now the cloning of any desirable membrane "bait" protein into 
e.g. pCbM-TDA and pTDA-CbM, and any desirable "prey" protein or a genomic/cDNA 
library into e.g. pNbM-HA-X or pX-HA-NbM vectors. For easy immunological detection 

15 pNbM-HA-X and pX-HA-NbM are tagged with a haemaglutinine(HA) epitope. All vectors 
were confirmed by DNA sequencing. 

The kit includes a host cell, preferably a yeast or bacterial strain that contains a detect- 
able gene having binding sites for the artificial transcription factor such as PLV. The 
20 binding site is positioned so that the reporter gene expresses a reporter protein when two 
proteins (the first protein and the second protein) properly interact in this system. The 
host cell, by itself, is incapable of expressing a protein having a function of the first 
marker gene (LEU2), the second marker gene (TRP1), the CbM-TDA portion, or the NbM 
domain. 

25 

The basic strategy of the testing method is shown in Figure 1 . The method is based on 
the previously developed split-ubiquitin technique. The split-ubiquitin technique is based 
on the ability of NbM and CbM, the N- and C-terminal halves of Ub, to assemble into 
quasi-native Ubiquitin. Ubiquitin-specific proteases (UBPs), which are present in all 
30 eukaryotic cells, recognize the reconstituted Ubiquitin, but not its halves, and cleave the 
Ubiquitin moiety off a reporter protein that had been linked to NbM or CbM, preferably to 
the C terminus of CbM. Quite in analogy to the two-hybrid system, the liberation of the 
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reporter serves as a readout indicating the reconstruction of Ubiquitin. The assay is de- 
signed in a way that prevents efficient association of NbM and CbM by themselves, but 
allows it if the two Ubiquitin halves are separately linked to proteins that interact in vivo. 

5 Figure 3A schematically illustrates two test membrane proteins, Y and X. The first chi- 
meric protein contains the Type I transmembrane bait protein Y fused to the CbM-TDA 
portion resulting in Y-CbM-TDA protein. The second chimeric protein contains a second 
interacting protein X (here depicted as a membrane protein) fused to the NbM domain 
called X-NbM. Neither of these proteins, Y-CbM-TDA and X-NbM, is able to activate tran- 

10 scription. 

Figure 3B. The interaction of proteins Y and X, results in reassociation of NbM and CbM 
to form split-ubiquitin. Split-ubiquitin is recognized and cleaved by the ubiquitin specific 
proteases (UBPs) (open scissors), liberating TDA. The TDA can enter the nucleus by 
15 diffusion and bind to the LexA-binding sites, leading to activation of transcription of the 

lacZ and H/S' reporter genes. This results in blue cells in the presence of X-gal and ^ 
growth of tftw^js on agar plates lacking hlstjdine. 

Figure 3C. Schematical illustration of the interaction between a Type II transmembrane 
20 protein (Y) and a cytoplasmic protein (X). The first chimeric protein contains the bait pro- 
tein Y fused to the TDV-CbM portion resulting in TDA-CbM-Y protein. The second chi- 
meric protein contains a second interacting protein X (here depicted as a cytoplasmic 
protein) fused to the NbM domain called NbM-X. Neither of these proteins, TDA-CbM-Y 
and NbM-X, is able to activate transcription. The interaction of proteins Y and X, as illus- 
25 trated in Figure 3c, results in formation of split-ubiquitin. Split-ubiquitin is recognized and 
cleaved by the ubiquitin specific proteases (UBPs) (open scissors), liberating TDV-CbM. 
The TDV-CbM can enter the nucleus by diffusion and bind to the LexA-binding sites 
leading to activation of transcription of the lacZ and HIS3 reporter genes. This results in 
blue cells in the presence of X-gal and growth of the cells on agar plates lacking histidine. 

30 

Figure 4A. Construction of yeast genomic DNA libraries in pX-HA-NbM and pNbM-HA-X 
vectors. Yeast genomic DNA is isolated from the wild type yeast strain and cleaved by 
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restriction endonuclease Sau3A, or is random sheared by sonication. DNA fragments 
ranging in size from 0.5-3 kb (blue boxes) are selected and cut out from the agarose gel, 
and ligated into pX-HA-NbM and pNbM-HA-X treated with BamHI and calf intestine alka- 
line phosphatase. The ligated DNA is transformed in E. coli and plasmid DNA is isolated 
5 and pooled together. Yeast genomic DNA library prepared in this way can than be trans- 
formed in the yeast reporter strain expressing a bait protein of interest (Y) fused to CbM- 
TDA portion. 

Figure4B. Construction of cDNA libraries in pX-HA-NbM and pNbM-HA-X vectors. Total 
10 mRNA is isolated from a certain cell line, tissue or organism, and is converted into cDNA 
via action of reverse transcriptase. The appropriate linkers are ligated to cDNA mole- 
cules, which are then introduced into a specific unique restriction site located in the MCS 
of pX-HA-NbM and pNbM-HA-X plasmids. 

15 Figure 5. Library transformation by the mating procedure. In this screening approach one 
CbM-TDA-fused bait X is screened against an entire library and positives are selected , 
based on their ability to grow on selection plates. Diploids that have survived selection in 
the library screening are picked up and the library plasmids encoding the interacting prey 
are isolated and sequenced in order to identify the interacting protein. As specified in the 

20 section X, the libraries can be made either from random genomic or cDNA fragments or 
from full-length ORFs that are cloned separately and then pooled. 

Figure 6. Possible outcomes of the MbY2H screen. A. Bait and prey protein do not inter- 
act in yeast. In this case, no reconstitution of NbM and CbM will take place and no split- 

25 ubiquitin will be formed. Since no split-ubiquitin is formed, the transcriptional activator 
polypeptide will not be cleaved off by ubiquitin-specific proteases and the transcriptional 
activator polypeptide is unable to reach the nucleus of the yeast cell. As a consequence, 
the reporter genes of the yeast transformant are not activated and will not produce any 
reporter protein. Therefore, the respective yeast transformant will be unable to grow on 

30 the selective medium. B. Bait and prey do interact. On interaction between bait and prey 
proteins, ubiquitin reconstitution occurs and leads to the proteolytic cleavage and subse- 
quent release of a transcription factor which triggers the activation of a reporter system. 
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Consequently, the respective yeast transformant will be able to grow on the selective 
medium and will appear as isolated yeast colonies after 3-5 days incubation at 30°C. 

Figure 7. General drug screening using the MbY2H technology. In this experimental de- 
5 sign, the HIS3 reporter is used for positive selection, while URA3 is used for counterse- 
lection. Interaction between two membrane proteins X and Yresults in reconstitution of 
quasi native ubiquitin and, consequently, in growth on medium lacking histidine, but le- 
thality on medium containing 5-FOA, a toxic metabolite of the uracil pathway. Following 
the screening with (i) a complex random chemical library, (ii) peptide library encoded by a 
10 plasmid, (iii) a peptide present in medium, (iv) minimal interaction domain expressed from 
the plasmid in yeast, or (v) a single chain antibody selected from the pool of antibodies 
fused to NbM, (i-v are collectively called drugs), drugs are selected that render the yeast 
cells to be ura3- or 5-FOA R and his3- (e.g. they do not grow on selective medium lacking 
uracil, they are resistant to 5_FOA and they do not grown on selective medium lacking 
15 histidine). 

Figure 8. Demonstration of the importance of promoter/origin of re£ifee£i?n choice in the 
MbY2H system. Yeast strain YG673 (bearing the endogenously integrated yeast Wbp1- 
CbM-PLV bait protein) was transformed with the pAlg-5-NbMG prey plasmid under con- 

20 trol of the strong yeast ADH promoter from either the 2u- (high-copy) or CEN/ARS- (low 
copy) plasmid. In the case of ADH promoter/2u ori combination, overexpression of the 
Alg5p resulted in the high expression of the HIS3 and JacZ reporter genes. However, 
when Alg5p was expressed under control of ADH promoter from the low copy number 
vector (CEN/ARS), no activation of the reporter genes was observed. As the positive 

25 control, an interacting Swp1 p was used. 

Figure 9.: Expression of the beta2-adrenergic receptor in the yeast reporter strain L40. 
(A) L40 was transformed with the construct construct pBAR-CbM-FLV and with an empty 
vector carrying the LEU2 resistance marker. Two independent transformants were se- 
30 lected from the transformation plates and grown in selective medium to an OD(600) of 
1.0. Extracts were prepared aaccording to standard procedures, run on 12% SDS-PAGE 
gels, transferred to nitrocellulose and the fusion proteins were detected using a polyclonal 
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rabbit antibody against the VP16 domain. Lanes 1 and 2: extracts from two independent 
clones transformed with the construct pBAR-CbM-TDA show a prominent band at 100 
kDa, the expected size for the beta2 AR-CbM-TDA fusion polypeptide, as well as smaller 
bands of 75, 40 and 35 kDa, possibly representing postranslational modification or pro- 

5 teolytic degradation products. Lanes 2 and 3: L40 transformed with empty control vector. 
No bands are detected by the VP16 antibody. (2) The beta2 AR is localized in the mem- 
brane. L40 cells transformed with pBAR-CbM-TDA were lysed and extracts were sepa- 
rated into soluble and particulate fractions. Lane 1: total extract. Two bands of 100 and 
75 kDa, as well as lower molecular weight bands around 37 kDa are detected by the 

10 VP16 antibody. Lane 2: soluble fraction. No receptor is found in the soluble fraction. Lane 
3: particulate fraction (membranes). The VP16 antibody recoginizes a 100 kDa band rep- 
resenting the beta2 adrenergic receptor. (C) Self-activation test of pBAR-CbM-TDA 
transformed in L40. Yeast transformants were plated on selective medium lacking leucine 
to select for the presence of the constructs and after incuabtion at 30°C for three days, a 

15 beta-galactosidase filter assay was performed. (1) Clone 1 tranformed with pBAR-CbM- 

TDA, lane 2: clone 2 transformed with a " r :-CbM-TDA, lane 3: emtpy vector, lane 4: « 
positive control. No beta-galactosidase --&vfe-<$£sion is found in two independent yeast v i 
colonies transformed with pBAR-CbM-TDA, indicating that the reporter genes of L40 are 
not activated by the expression of beta2 AR-CbM-TDA. 

20 

Figure 10. Detection of protein-protein interactions in the MbY2H system requires high 
level expression of baits and preys. (A) Schematic representation of the NTF: NTF inter- 
action in the MbY2H system. NTF baits are fused C-terminally to the Cub and the LexA- 
VP16 reporter cassette (Rep, LexA contains the R157G mutation to decrease the 

25 strength of the nuclear localization signal), whereas preys are fused N-terminally to NubG 
(NUB). Baits are expressed from a vector containing the CEN/ARS origin of replication, a 
LEU2 gene for selection in yeast and either a CYC1 promoter (low expression level, 
designated pCMBVI) or an ADH1 promoter (high expression level, designated pAMBVI). 
Preys are expressed from a vector containing a 2micron origin of replication, a TRP1 

30 gene for selection in yeast and either a CYC1 promoter (designated pDSL) or an ADH1 
promoter (designated pADSL). (B) Homodimerization of NTF as assayed in the MbY2H 
system. Yeast strain L40 was transformed with the expression vectors as indicated and 
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plated on minimal medium, selecting for cells that contained both plasmids. After 3 days 
growth at 30°C, 10 colonies were picked, resuspended in 0.9% NaCI and aliquots were 
spotted onto selective plates lacking tryptophan, leucine and histidine to select for pro- 
tein-protein interactions. Weakly expressed baits (pCMBV1-NTF) and preys (pDSL- 

5 NubG-NTF) show no growth, whereas a strongly expressed bait (pAMBV1-NTF) in com- 
bination with a weakly expressed prey (pDSL-NubG-NTF) shows only partial growth on 
selective medium. A strongly expressed prey (pADSL-NubG-NTF) shows growth on se- 
lective medium both with a weakly and a strongly expressed bait. Neither weakly nor 
strongly expressed baits interact with a non-cognate control prey expressed at high levels 

10 (pDSL-Alg5-NubG). Note that only the strongly expressed bait is capable of interacting 
with the positive control (pDSL-Alg5-Nubl). This indicates that heterologous transmem- 
brane baits and preys have to be expressed at high levels in order to detect an interaction 
in the MbY2H system. 

15 Figure 11. Interaction between the receptor tyrosine kinase ErbB3 and the adaptor pro- 
tein Nrdpl in JJic' \"bY2H system. MbY2H baits were fused to the Cub-LexA-VP16 re-, 
porter module .^pining the R157G mutation to decrease the strength of the nuclear 
localization signal (designated LexM) and preys were fused to NubG. (a) Yeast coex- 
pressing pAMBN/1 -ErbB3 (the entire coding sequence of rat ErbB3 fused to Cub-LexM- 

20 VP16, driven by a strong ADH1 promoter from a vector containing a CEN/ARS element of 
replication and a LEU2 gene for selection) and pADSL-Nrdpl (amino acids 135 to 317 of 
human Nrdpl fused N-terminally to NubG, driven by a strong ADH1 promoter from a 
vector containing the 2micron origin of replication and a TRP1 gene for selection) were 
spotted onto medium lacking tryptophan and leucine (SD-TL) to select for cells containing 

25 both bait and prey vectors and onto medium lacking tryptophan, leucine and histidine 
(SD-HTL) to select for the interaction between ErbB3 amd Nrdpl. Cells coexpressing 
ErbB3 and Nrdpl grow on SD-HTL. (b) Yeast coexpressing the control bait pMBV1-Alg5 
(the entire coding sequence of yeast Alg5 fused to Cub-LexM-VP16, driven by a CYC1 
promoter from a vector containing a CEN/ARS element of replication and a LEU2 gene 

30 for selection) and pADSL-Nrdpl were spotted onto SD-TL and SD-HTL. Cells coex- 
pressing Alg5 and Nrdpl do not grow on SD-HTL. The actual protein levels of yeast Alg5 
expressed from the CYC1 promoter and ErbB3 expressed from the ADH1 promoter are 
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similar, (c) Yeast coexpressing the bait pAMBV1-ErbB3 and the control prey pADSL-Alg5 
were spotted onto SD-TL and SD-HTL Cells coexpressing ErbB3 and Alg5 do not grow 
on SD-HTL. 

5 Figure 12. Bait dependency test using the beta2-adrenergic receptor expressed at two 
different levels in combination with preys identified in a MbY2H screen. A bait depend- 
ency test was carried out by cotransforming the beta2-adrenergic receptor bait with sev- 
eral prey clones identified in the MbY2H screen described in Example 7. Bait and prey 
clones were cotransformed into the yeast strain and treated as described in Section 8 

10 "Bait dependency test". (A) Beta2-adrenergic receptor expressed from the weak CYC1 
promoter. Only a subset of the prey clones show growth on selective medium. (B) Beta2- 
adrenergic receptor expressed from the strong ADH1 promoter. All prey clones show 
growth on selective medium. None of the preys showed any growth when coexpressed 
with a Alg5 control bait expressed at high levels. Light green bars: growth on SD-HTL. 

15 Dark green bars: growth on selective medium supplemented with 5 mM aminotriazole. 

Figure 13. Principle bait vectors of the MbY2H system. (A) In pCMBVI expression of the 
bait is driven by a weak CYC1 prompter. The sites Xba I and Pst I allow subcloning of a 
cDNA in frame with the following Cub-LexM-VP16 cassette. Transcription is terminated 
20 by an ADH1 terminator (ADHt). The vector backbone contains a kanamycine resistance 
gene for propagation in E. coli (Kan), a CEN/ARS origin of replication (CEN/ARS) and a 
LEU2 gene for propagation in yeast (LEU2). (B) The vector pAMBVI is identical to 
pCMBVI , except for the strong ADH1 promoter which drives expression of bait. 

25 Figure 14. Principle prey vectors of the MbY2H system. (A) In pDSL-Nx, expression of 
the prey is driven by a weak CYC1 promoter. Two unique Sfi I sites allow directional 
cloning of the prey cDNA as a C-terminal fusion to NubG. Transcription is terminated by a 
CYC1 terminator (CYC1t). The backbone of the vector contains an ampicillin resistance 
gene for propagation in E. coli (Amp), a 2micron origin and a TRP1 gene for propagation 

30 in yeast. (B) pADSL-Nx is identical to pDSL-Nx, except for the strong ADH1 promoter 
driving expression of the prey. (C) pDSL-xN is identical to pDSL-Nx except that here, 
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preys are fused N-terminally to NubG. (D) pADSL-Nx is identical to pDSL-Nx, except for 
the strong ADH1 promoter driving expression of the prey. 

The system is dependent on a number of conditions to properly carry out the method of 
this invention. The first interacting („bait") protein must be an integral membrane protein, 
a membrane-associated protein, or it may be artificially attached to the membrane using 
e.g. fatty acid modification. In order to avoid false-positives, the bait protein carrying 
CbM-TDA may either be controlled bya promoter conferring tow level expression or, if a 
promoter conferring high level expression is used, nucleic activity of the TDA may be 
controlled using other mechanisms (e.g. use of a mutated transcription factor such as a 
mutated LexA). We found that a soluble CbM-TDA results in gene activation without the 
need for any NbM. Therefore, the bait fusion protein has to be anchored to the lipid bi- 
layer in order to test for interactions. Soluble proteins of interest might be tested by fusing 
them to a membrane protein anchor. In addition, the CbM-TDA and NbM domains must 
be located in the cytoplasm, otherwise the cleavage of the PLV portion cannot occur 
since UBPs are located only in the cytoplasm of the yeast cell. 

Bait vectors 

Bait vectors are plasmid constructs which preferably contain the following features: 

(1) A selection marker for propagation of the plasmids in E. coli, which preferably is an 
expression cassette encoding the kanamycine resistance gene, but which may also be an 
expression cassette encoding the ampicillin resistance gene or the chloramphenicol re- 
sistance gene. In fact, any other marker that is selectable in E. coli may be used for this 
purpose. 

(2) An origin of replication that allows propagation of the plasmids in E. coli, such as a 
pUC based or pBR322 origin of replication, but preferably an origin of replication that al- 
lows the plasmid to be propagated in E. coli at a high copy number. 
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(3) A selection marker for propagation of the plasmids in S. cerevisae, which preferably is 
an expression cassette encoding the LEU2 gene, but which may also be any other 
marker that is selectable in S. cerevisae. 

5 (4) A CEN/ARS origin of replication which allows propagation of the plasmids in S. cere- 
visae at a low copy number (usually 1-2 copies per cell). The. use of a CEN/ARS origin of 
replication in the MbY2H system is one option to ensure that the bait polypeptide is ex- 
pressed at a very low level. High level expression of the bait polypeptide in yeast may 
lead to the activation of the reporter genes (so-called self-activation, compare for instance 

10 self-activation in the yeast two-hybrid system ((Fields & Song, 1989, Golemis et al., 
1999), (Serebriiskii & Golemis, 2001), Clontech MATCHMAKER System 3 User Manual). 

(5) An expression cassette, preferably contains the elements as follows: 

(1) A promoter that confers low level expression, such as a CYC1 promoter, or a 
1 5 CUP1 promoter which is inducible by the addition of copper to the medium (Angermayr et 

al., 2000, Macreadie et al., 1989), or confers hip/v level expression (e.g. ADH1 or TEF1 r m 
promoter). The use of an inducible promoter n^ &fe following advantages: a) when in- ^ 
duced, it leads to overexpression of the bait polypeptide in yeast, b) when not induced, 
the promoter only produces a very low level of bait polypeptide, which ensures proper 
20 insertion of the bait polypeptide into the membrane of the yeast cells and prevents prob- 
lems of self-activation that are connected to the overexpression of the bait polypeptide. 
The use of an inducible promoter represents an important advantage since it allows veri- 
fication of correct expression and subsequent library screening to be carried out using the 
same bait construct. 

25 

(2) A nucleic acid sequence encoding a leader which may be a signal sequence de- 
rived from a yeast integral membrane protein such as STE2 (Overton & Blumer, 2000) or 
a signal sequence which confers fatty acid modification to the following polypeptide (N- 
MGCTLS AE D KPGG P-C) which is in the same reading frame as the reading frame of the 

30 signal sequence (Angermayr et al., 2000, Wolven et al., 1997). 
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(3) A multiple cloning site containing one or several recognition sites for restriction 
enzymes which allow the insertion of a nucleic acid encoding a protein or a protein frag- 
ment or a polypeptide, and sequences flanking a defined recognition site for a restriction 
endonuclease, preferably the restriction endonucleases Stu I or Eco47 ///, which are used 

5 to insert a nucleic acid sequence encoding a protein or protein fragment or a polypeptide 
established in vivo recombination method as described above. 

(4) A nucleic acid sequence encoding the C-terminal open reading frame of yeast 
ubiqutin (CbM), encompassing amino acids 35-76 (for amino acid numbering of ubiquitin 

10 and exact explanation of ubiquitin fragments, see (Johnsson & Varshavsky, 1994). 

(5) The expression of the bait polypeptide can then be easily verified using standard 
procedures such as detection by immunoblot using a specific antibody directed against 
either e.g. the Gal4, LexM, B42 or VP16 moieties (see below), or preferably, against the 

15 3xFLAG, 3xMYC or HA epitopes that are present in the bait fusion. The vector contains a 
nucleic acid sequence ^coding an epitope tag, which can be either the 3xFLAG epitope 
(Chubet & Brizzard, ;i%, Hernan et a)., 2000), the 3x MYC epitope (Evan et al., 1985), 
or any other sequence encoding a polypeptide that can be detected by any means, such 
as immunoblotting, A nucleic acid sequence encoding another epitope tag, such as the 

20 hemagglutinin epitope tag (Wilson et al., 1984), may also be present. 

(6) A nucleic acid sequence encoding a polypeptide with the ability to specifically bind 
to a defined nucleic acid sequence, which can be the sequence encoding the bacterial 
LexM protein (Fogh et al M 1994), or preferably, the sequence encoding the yeast Gal4 

25 protein (Gardner et al., 1991, Pan & Coleman, 1989), preferably the sequence encoding 
amino acids 1-93 of the yeast Gal4 protein, or most preferably, the sequence encoding 
the yeast Gal4 protein, amino acids 1-74. 

(7) A nucleic acid sequence encoding a tanscriptional activator domain, such as the 
30 Herpes simplex virus protein VP16 (Shen et al., 1996, Wu et al., 1994) or more prefera- 
bly, the acidic domain B42 (Hughes et al., 1996). 
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(8) A CYC1 or ADH1 terminator sequence. 
Description of preferred bait vectors: 

(1) pCbM-TDA is a low copy yeast/E. coli shuttle vector carrying a CYC1 promoter for 
5 low level expression in yeast, followed by a multiple cloning site encoding several recog- 
nition sites for restriction endonucleases, which can be used to insert a nucleic acid en- 
coding the polypeptide of interest. The multiple cloning site is followed by a nucleic acid 
sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a sequence 
encoding the 3xFLAG epitope, followed by the sequence encoding the bacterial LexM 

10 protein, followed by the sequence encoding the Herpes simplex virus VP16 protein, fol- 
lowed by a CYC1 terminator. The backbone of the plasmid contains the LEU2 gene for 
selection in yeast, the kanamycine resistance cassette for selection in E. coli, the 
CEN/ARS origin of replication for propagation in yeast and the pUC origin of replication 
for propagation in E. coli. This vector is suitable for the low level expression of CbM-fused 

15 polypetides in yeast, due to the combination of a weak CYC1 promoter and a CEN/ARS 
origin of replication, which results in low copy numbers of the plasmid in yeast (usually 1- 
2 copies per cell). 

(2) pCbM-TDA is a low copy yeast/E. coli shuttle vector carrying a CYC1 promoter for 
20 low level expression in yeast, followed by a multiple cloning site encoding several recog- 
nition sites for restriction endonucleases, which can be used to insert a nucleic acid en- 
coding the polypeptide of interest. The multiple cloning site is followed by a nucleic acid 
sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a sequence 
encoding the 3xMYC epitope, followed by the sequence encoding the bacterial LexM 

25 protein, followed by the sequence encoding the Herpes simplex virus VP16 protein, fol- 
lowed by a CYC1 terminator. The backbone of the plasmid contains the LEU2 gene for 
selection in yeast, the kanamycine resistance cassette for selection in E. coli, the 
CEN/ARS origin of replication for propagation in yeast and the pUC origin of replication 
for propagation in E. coli. This vector is suitable for the low level expression of CbM-fused 

30 polypetides in yeast, due to the combination of a weak CYC1 promoter and a CEN/ARS 
origin of replication, which results in low copy numbers of the plasmid in yeast (usually 1- 
2 copies per cell). 
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(3) pMP-CbM-TDA is a low copy yeast/E. coli shuttle vector carrying a CYC1 pro- 
moter for low level expression in yeast, followed by a nucleic acid sequence encoding the 
N-MGCTLSAEDKPGGP-C signal sequence for fatty acid modification, followed by a mul- 
tiple cloning site encoding several recognition sites for restriction endonucleases, which 
can be used to insert a nucleic acid encoding the polypeptide of interest. The multiple 
cloning site is followed by a nucleic acid sequence encoding amino acids 35-76 of yeast 
ubiquitin (CbM), followed by a sequence encoding the 3xMYC epitope, followed by the 
sequence encoding the bacterial LexM protein, followed by the sequence encoding the 
Herpes simplex virus VP16 protein, followed by a CYC1 terminator. The backbone of the 
plasmid contains the LEU2 gene for selection in yeast, the kanamycine resistance cas- 
sette for selection in E. coli, the CEN/ARS origin of replication for propagation in yeast 
and the pUC origin of replication for propagation in E. coli. This vector is suitable for the 
low level expression of CbM-fused polypetides in yeast, due to the combination of a weak 
CYC1 promoter and a CEN/ARS origin of replication, which results in low copy numbers 
of the plasmid in yeast (usually 1-2 copies per cell). 

(4) pCUP1-CbM-TDA is a low copy yeast/E. coli shuttle vector carrying a CUP1 pro- 
-moterfor inducible expression in yeast, followed by a multiple cloning site encoding sev- 
eral recognition sites for restriction endonucleases, which can be used to insert a nucleic 
acid encoding the polypeptide of interest. The multiple cloning site is followed by a nu- 
cleic acid sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a 
sequence encoding the 3xFLAG epitope, followed by the sequence encoding the bacte- 
rial LexM protein, followed by the sequence encoding the Herpes simplex virus VP16 
protein, followed by a CYC1 terminator. The backbone of the construct contains the LEU2 
gene for selection in yeast, the kanamycine resistance cassette for selection in E. coli, 
the CEN/ARS origin of replication for propagation in yeast and the pUC origin of replica- 
tion for propagation in E. coli. This vector is suitable for the inducible expression of CbM- 
fused polypeptides in yeast, due to the combination of an inducible CUP1 promoter and a 
CEN/ARS origin of replication, which results in low copy numbers of the construct in 
yeast (usually 1-2 copies per cell). 
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(5) pMP-CbM-TDA is a low copy yeast/E. coli shuttle vector carrying a CYC1 pro- 
moter for low level expression in yeast, followed by a nucleic acid sequence encoding the 
N-MGCTLSAEDKPGGP-C signal sequence for fatty acid modification, followed by a mul- 
tiple cloning site encoding several recognition sites for restriction endonucleases, which 
can be used to insert a nucleic acid encoding the polypeptide of interest. The multiple 
cloning site is followed by a nucleic acid sequence encoding amino acids 35-76 of yeast 
ubiquitin (CbM), followed by a sequence encoding the 3xMYC epitope, followed by the 
sequence encoding the bacterial LexM protein, followed by the sequence encoding the 
Herpes simplex virus VP16 protein, followed by a CYC1 terminator. The backbone of the 
construct contains the LEU2 gene for selection in yeast, the kanamycine resistance cas- 
sette for selection in E. coli, the CEN/ARS origin of replication for propagation in yeast 
and the pUC origin of replication for propagation in E. coli. This vector is suitable for the 
low level expression of CbM-fused polypeptides in yeast, due to the combination of a 
weak CYC1 promoter and a CEN/ARS origin of replication, which results in low copy 
numbers of the construct in yeast (usually 1-2 copies per cell). 

(6) pCUP1-MP-CbM-TDA is a low copy yeas^. ^4! shuttle vector carrying a CUP1 
promoter for inducible expression in yeast, followed by a nucleic acid sequence encoding 
the N-MGCTLSAEDKPGGP-C signal sequence for fatty acid modification, followed by a 
multiple cloning site encoding several recognition sites for restriction endonucleases, 
which can be used to insert a nucleic acid encoding the polypeptide of interest. The mul- 
tiple cloning site is followed by a nucleic acid sequence encoding amino acids 35-76 of 
yeast ubiquitin (CbM), followed by a sequence encoding the 3xMYC epitope, followed by 
the sequence encoding the bacterial LexM protein, followed by the sequence encoding 
the Herpes simplex virus VP16 protein, followed by a CYC1 terminator. The backbone of 
the construct contains the LEU2 gene for selection in yeast, the kanamycine resistance 
cassette for selection in E. coli, the CEN/ARS origin of replication for propagation in yeast 
and the pUC origin of replication for propagation in E. coli. replication for propagation in 
yeast and the pUC origin of replication for propagation in E. coli. This vector is suitable for 
the inducible expression of CbM-fused polypetides in yeast, due to the combination of an 
inducible CUP1 promoter and a CEN/ARS origin of replication, which results in low copy 
numbers of the construct in yeast (usually 1-2 copies per cell). 
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(7) pCUP1-CbM-TDA is a low copy yeast/E. coli shuttle vector carrying a CUP1 pro- 
moter for inducible expression in yeast, followed by a multiple cloning site encoding sev- 
eral recognition sites for restriction endonucleases, which can be used to insert a nucleic 

5 acid encoding the polypeptide of interest. The multiple cloning site is followed by a nu- 
cleic acid sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a 
sequence encoding the 3xFLAG epitope, followed by the sequence encoding the bacte- 
rial LexM protein, followed by the sequence encoding the Herpes simplex virus VP16 
protein, followed by a CYC1 terminator. The backbone of the plasmid contains the LEU2 

10 gene for selection in yeast, the kanamycine resistance cassette for selection in E. coli, 
the CEN/ARS origin of replication for propagation in yeast and the pUC origin of replica- 
tion for propagation in E. coli. This vector is suitable for the inducible expression of CbM- 
fused polypetides in yeast, due to the combination of an inducible CUP1 promoter and a 
CEN/ARS origin of replication, which results in low copy numbers of the plasmid in yeast 

15 (usually 1-2 copies per cell). 

(8) pGA93B42 is a lon.wpf yeast/E. coli shuttle vector carrying a CYC1 promoter for 
low level expression in yeast, followed by a multiple cloning site encoding several recog- 
nition sites for restriction endonucleases, which can be used to insert a nucleic acid en- 

20 coding the polypeptide of interest. The multiple cloning site is followed by a nucleic acid 
sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a sequence 
encoding amino acids 1-93 of the yeast Gal4 protein, followed by the sequence encoding 
the acidic domain B42, followed by an ADH1 terminator. The backbone of the plasmid 
contains the LEU2 gene for selection in yeast, the kanamycine resistance cassette for 

25 selection in E. coli, the CEN/ARS origin of replication for propagation in yeast and the 
pUC origin of replication for propagation in E. coli. This vector is suitable for the low level 
expression of CbM-fused polypetides in yeast, due to the combination of a weak CYC1 
promoter and a CEN/ARS origin of replication, which results in low copy numbers of the 
plasmid in yeast (usually 1-2 copies per cell). 

30 

(9) pGA74B42 is a low copy yeast/E. coli shuttle vector carrying a CYC1 promoter for 
low level expression in yeast, followed by a multiple cloning site encoding several recog- 
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nition sites for restriction endonucleases, which can be used to insert a nucleic acid en- 
coding the polypeptide of interest. The multiple cloning site is followed by a nucleic acid 
sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a sequence 
encoding amino acids 1-74 of the yeast Gal4 protein, followed by the sequence encoding 
5 the acidic domain B42, followed by an ADH1 terminator. The backbone of the plasmid 
contains the LEU2 gene for selection in yeast, the kanamycine resistance cassette for 
selection in E. coli, the CEN/ARS origin of replication for propagation in yeast and the 
pUC origin of replication for propagation in E. coli. This vector is suitable for the low level 
expression of CbM-fused polypetides in yeast, due to the combination of a weak CYC1 
10 promoter and a CEN/ARS origin of replication, which results in low copy numbers of the 
plasmid in yeast (usually 1-2 copies per cell). This vector allows high stringency screens 
thanks to the use of a severely truncated Gal4 protein. Gal4 amino acids 1-74 retains the 
minimal elements necessary for recognition and binding to the GAL promoter, but it lacks 
the elements necessary for dimerization of the Gal4 protein. Therefore, binding of Gal4 
15 amino acids 1-74 to the GAL1 promoter is not cooperative anymore. The non-cooperative 

irsade of binding severely reduces the affinity of Gal4 amino acids 1-74 for the GAL1 
promoter as compared to Gal4 amino acids 1-93. Consequently, higher levels of Gal4 

amino acids 1-74 are needed in the nucleus to activate transcription of the reporter 

genes. This higher level can only be reached by an overall higher level of released Gal4 
20 (amino acids 1-74)-B42. Only a very strong interaction between a bait protein and a prey 

protein is able to release the amounts of Gal4 (amino acids 1-74)-B42 needed to activate 

transcription of the reporter genes. 

(10) pCGA93B42 is a low copy yeast/E. coli shuttle vector carrying a CUP1 promoter 
25 for inducible expression in yeast, followed by a multiple cloning site encoding several 
recognition sites for restriction endonucleases, which can be used to insert a nucleic acid 
encoding the polypeptide of interest. The multiple cloning site is followed by a nucleic 
acid sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a se- 
quence encoding amino acids 1-93 of the yeast Gal4 protein, followed by the sequence 
30 encoding the acidic domain B42, followed by an ADH1 terminator. The backbone of the 
plasmid contains the LEU2 gene for selection in yeast, the kanamycine resistance cas- 
sette for selection in E. coli, the CEN/ARS origin of replication for propagation in yeast 
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and the pUC origin of replication for propagation in E. coli. This vector is suitable for the 
inducible expression of CbM-fused polypetides in yeast, due to the combination of an 
inducible CUP1 promoter and a CEN/ARS origin of replication, which results in low copy 
numbers of the plasmid in yeast (usually 1-2 copies per cell). 

5 

(11) pCGA74B42 is a low copy yeast/E. coli shuttle vector carrying a CUP1 promoter 
for inducible expression in yeast, followed by a multiple cloning site encoding several 
recognition sites for restriction endonucleases, which can be used to insert a nucleic acid 
encoding the polypeptide of interest. The multiple cloning site is followed by a nucleic 

10 acid sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a se- 
quence encoding amino acids 1-93 of the yeast Gal4 protein, followed by the sequence 
encoding the acidic domain B42, followed by an ADH1 terminator. The backbone of the 
plasmid contains the LEU2 gene for selection in yeast, the kanamycine resistance cas- 
sette for selection in E. coli, the CEN/ARS origin of replication for propagation in yeast 

15 and the pUC origin of replication for propagation in E. coli. This vector is suitable for the 

inducible expression of CbM-fused polypetides in yeast, due to the combination -r ' an * 
inducible CUP1 promoter and a CEN/ARS origin of replication, which results in fcmmpy. 
numbers of the plasmid in yeast (usually 1-2 copies per cell). This vector allows high 
stringency screens thanks to the use of a severely truncated Gal4 protein. Gal4 amino 

20 acids 1 -74 retains the minimal elements necessary for recognition and binding to the GAL 
promoter, but it lacks the elements necessary for dimerization of tge Gal4 protein. 
Therefore, binding of Gal4 amino acids 1-74 to the GAL1 promoter is not cooperative 
anymore. The non-cooperative mode of binding severely reduces the affinity of Gal4 
amino acids 1-74 for the GAL1 promoter as compared to Gal4 amino acids 1-93. Conse- 

25 quently, higher levels of Gal4 amino acids 1-74 are needed in the nucleus to activate 
transcription of the reporter genes. This higher level can only be reached by an overall 
higher level of released Gal4 (amino acids 1-74)-B42. Only a very strong interaction be- 
tween a bait protein and a prey protein is able to release the amounts of GaI4 (amino ac- 
ids 1-74)-B42 needed to activate transcription of the reporter genes. 

30 

(12) pMGA93B42 is a low copy yeast/E. coli shuttle vector carrying a CYC1 promoter 
for low level expression in yeast, followed by a nucleic acid sequence encoding the N- 
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MGCTLSAEDKPGGP-C signal sequence for fatty acid modification, followed by a multi- 
ple cloning site encoding several recognition sites for restriction endonucleases, which 
can be used to insert a nucleic acid encoding the polypeptide of interest. The multiple 
cloning site is followed by a nucleic acid sequence encoding amino acids 35-76 of yeast 
5 ubiquitin (CbM), followed by a sequence encoding amino acids 1-93 of the yeast Gal4 
protein, followed by the sequence encoding the acidic domain B42, followed by an ADH1 
terminator. The backbone of the plasmid contains the LEU2 gene for selection in yeast, 
the kanamycine resistance cassette for selection in E. coli, the CEN/ARS origin of repli- 
cation for propagation in yeast and the pUC origin of replication for propagation in E. coli. 
10 This vector is suitable for the low level expression of CbM-fused polypetides in yeast, due 
to the combination of a weak CYC1 promoter and a CEN/ARS origin of replication, which 
results in low copy numbers of the plasmid in yeast (usually 1-2 copies per cell). 

(13) pMGA74B42 is a low copy yeast/E. coli shuttle vector carrying a CYC1 promoter 

15 for low level expression in yeast, followed by a nucleic acid sequence encoding the N- 
MGCTLSAEDKPGGP-C signal sequence for fatty acid modification, followed by a multi- 
ple cloning site encoding several recognition sites for- friction endonucleases, which 
can be used to insert a nucleic acid encoding the polypeptide of interest. The multiple 
cloning site is followed by a nucleic acid sequence encoding amino acids 35-76 of yeast 

20 ubiquitin (CbM), followed by a sequence encoding amino acids 1-74 of the yeast Gal4 
protein, followed by the sequence encoding the acidic domain B42, followed by an ADH1 
terminator. The backbone of the plasmid contains the LEU2 gene for selection in yeast, 
the kanamycine resistance cassette for selection in E. coli, the CEN/ARS origin of repli- 
cation for propagation in yeast and the pUC origin of replication for propagation in E. coli. 

25 This vector is suitable for the low level expression of CbM-fused polypetides in yeast, due 
to the combination of a weak CYC1 promoter and a CEN/ARS origin of replication, which 
results in low copy numbers of the plasmid in yeast (usually 1-2 copies per cell). This 
vector allows high stringency screens thanks to the use of a severely truncated Gal4 
protein. Gal4 amino acids 1-74 retains the minimal elements necessary for recognition 

30 and binding to the GAL promoter, but it lacks the elements necessary for dimerization of 
the Gal4 protein. Therefore, binding of Gal4 amino acids 1-74 to the GAL1 promoter is 
not cooperative anymore. The non-cooperative mode of binding severely reduces the 



WO 03/083136 




'CT7EP03/03287 



affinity of GaW amino acids 1-74 for the GAL1 promoter as compared to Gal4 amino ac- 
ids 1-93. Consequently, higher levels of Gal4 amino acids 1-74 are needed in the nucleus 
to activate transcription of the reporter genes. This higher level can only be reached by 
an overall higher level of released GaW (amino acids 1-74)-B42. Only a very strong inter- 
5 action between a bait protein and a prey protein is able to release the amounts of GaW 
(amino acids 1-74)-B42 needed to activate transcription of the reporter genes. 

(14) pCMGA93B42 is a low copy yeast/E. coli shuttle vector carrying a CUP1 promoter 
for inducible expression in yeast, followed by a nucleic acid sequence encoding the N- 

10 MGCTLSAEDKPGGP-C signal sequence for fatty acid modification, followed by a multi- 
ple cloning site encoding several recognition sites for restriction endonucleases, which 
can be used to insert a nucleic acid encoding the polypeptide of interest The multiple 
cloning site is followed by a nucleic acid sequence encoding amino acids 35-76 of yeast 
ubiquitin (CbM), followed by a sequence encoding amino acids 1-93 of the yeast GaW 

15 protein, followed by the sequence encoding the acidic domain B42, followed by an ADH1 

terminator. The backbone of .t^plasmid contains the LEU2 gene for selection in yeast, _ 
the kanamycine resistance ca^tie for selection in E. coli, the CEN/ARS origin of repli- 
cation for propagation in yeast and the pUC origin of replication for propagation in E. coli. 
This vector is suitable for the inducible expression of CbM-fused polypetides in yeast, due 

20 to the combination of an inducible CUP1 promoter and a CEN/ARS origin of replication, 
which results in low copy numbers of the plasmid in yeast (usually 1-2 copies per cell). 

(15) pCMGA74B42 is a low copy yeast/E. coli shuttle vector carrying a CUP1 promoter 
for inducible expression in yeast, followed by a nucleic acid sequence encoding the N- 

25 MGCTLSAEDKPGGP-C signal sequence for fatty acid modification, followed by a multi- 
ple cloning site encoding several recognition sites for restriction endonucleases, which 
can be used to insert a nucleic acid encoding the polypeptide of interest. The multiple 
cloning site is followed by a nucleic acid sequence encoding amino acids 35-76 of yeast 
ubiquitin (CbM), followed by a sequence encoding amino acids 1-74 of the yeast GaW 

30 protein, followed by the sequence encoding the acidic domain B42, followed by an ADH1 
terminator. The backbone of the plasmid contains the LEU2 gene for selection in yeast, 
the kanamycine resistance cassette for selection in E. coli, the CEN/ARS origin of repli- 
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cation for propagation in yeast and the pUC origin of replication for propagation in E. coli. 
This vector is suitable for the inducible expression of CbM-fused polypetides in yeast, due 
to the combination of an inducible CUP1 promoter and a CEN/ARS origin of replication, 
which results in low copy numbers of the plasmid in yeast (usually 1-2 copies per cell). 

5 This vector allows high stringency screens thanks to the use of a severely truncated Gal4 
protein. Gal4 amino acids 1-74 retains the minimal elements necessary for recognition 
and binding to the GAL promoter, but it lacks the elements necessary for dimerization of 
the Gal4 protein. Therefore, binding of Gal4 amino acids 1-74 to the GAL1 promoter is 
not cooperative anymore. The non-cooperative mode of binding severely reduces the 

10 affinity of Gal4 amino acids 1-74 for the GAL1 promoter as compared to Gal4 amino ac- 
ids 1-93. Consequently, higher levels of Gal4 amino acids 1-74 are needed in the nucleus 
to activate transcription of the reporter genes. This higher level can only be reached by 
an overall higher level of released Gal4 (amino acids 1-74)-B42. Only a very strong inter- 
action between a bait protein and a prey protein is able to release the amounts of Gal4 

1 5 (amino acids 1 -74)-B42 needed to activate transcription of the reporter genes. 

(ie; jSMP-CbM-ML-MCS is a low copy yeast/E coli shuttle vector carrying a CYC1 
promoter for low level expression in yeast, followed by a nucleic acid sequence encoding 
the N-MGCTLSAEDKPGGP-C signal sequence for fatty acid modification, followed by a 

20 nucleic acid sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by 
a sequence encoding the 3xMYC epitope, followed by the sequence encoding the bacte- 
rial LexM protein, followed by a multiple cloning site encoding several recognition sites for 
restriction endonucleases, which can be used to insert a nucleic acid encoding the poly- 
peptide of interest, followed by a CYC1 terminator. The backbone of the construct con- 

25 tains the LEU2 gene for selection in yeast, the kanamycine resistance cassette for selec- 
tion in E. coli, the CEN/ARS origin of replication for propagation in yeast and the pUC 
origin of replication for propagation in E. coli. This vector is suitable for the low level ex- 
pression of CbM-fused polypetides in yeast, due to the combination of a weak CYC1 
promoter and a CEN/ARS origin of replication, which results in low copy numbers of the 

30 construct in yeast (usually 1 -2 copies per cell). 
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(17) pCUP1-MP-CbM-ML-MCS is a low copy yeast/E. coli shuttle vector carrying a 
CUP1 promoter for inducible expression in yeast, followed by a nucleic acid sequence 
encoding the N-MGCTLSAEDKPGGP-C signal sequence for fatty acid modification, fol- 
lowed by a nucleic acid sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), 
followed by a sequence encoding the 3xMYC epitope, followed by the sequence encod- 
ing the bacterial LexM protein, followed by a multiple cloning site encoding several rec- 
ognition sites for restriction endonucleases, which can be used to insert a nucleic acid 
encoding the polypeptide of interest, followed by a CYC1 terminator. The backbone of the 
construct contains the LEU2 gene for selection in yeast, the kanamycine resistance cas- 
sette for selection in E. coli, the CEN/ARS origin of replication for propagation in yeast 
and the pUC origin of replication for propagation in E. coli. replication for propagation in 
yeast and the pUC origin of replication for propagation in E. coli. This vector is suitable for 
the inducible expression of CbM-fused polypetides in yeast, due to the combination of an 
inducible CUP1 promoter and a CEN/ARS origin of replication, which results in low copy 
numbers of the construct in yeast (usually 1-2 copies per cell). 

(18) pMGA93-MCS is a low copy yeast/E. coli shuttle vector carrying a CYC1 prom ^v? 
for low level expression in yeast, followed by a nucleic acid sequence encoding the N- 
MGCTLSAEDKPGGP-C signal sequence for fatty acid modification, followed by a nucleic 
acid sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a se- 
quence encoding amino acids 1-93 of the yeast Gal4 protein, followed by a multiple 
cloning site encoding several recognition sites for restriction endonucleases, which can 
be used to insert a nucleic acid encoding the polypeptide of interest, followed by an 
ADH1 terminator. The backbone of the construct contains the LEU2 gene for selection in 
yeast, the kanamycine resistance cassette for selection in E. coli, the CEN/ARS origin of 
replication for propagation in yeast and the pUC origin of replication for propagation in E. 
coli. This vector is suitable for the low level expression of CbM-fused polypetides in yeast, 
due to the combination of a weak CYC1 promoter and a CEN/ARS origin of replication, 
which results in low copy numbers of the construct in yeast (usually 1-2 copies per cell). 

(19) pMGA74-MCS is a low copy yeast/E. coli shuttle vector carrying a CYC1 promoter 
for low level expression in yeast, followed by a nucleic acid sequence encoding the N- 
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MGCTLSAEDKPGGP-C signal sequence for fatty acid modification, followed by a nucleic 
acid sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a se- 
quence encoding amino acids 1-74 of the yeast Gal4 protein, followed by a multiple 
cloning site encoding several recognition sites for restriction endonucleases, which can 
be used to insert a nucleic acid encoding the polypeptide of interest, followed by an 
ADH1 terminator. The backbone of the construct contains the LEU2 gene for selection in 
yeast, the kanamycine resistance cassette for selection in E. coli, the CEN/ARS origin of 
replication for propagation in yeast and the pUC origin of replication for propagation in E. 
coli. This vector is suitable for the low level expression of CbM-fused polypetides in yeast, 
10 due to the combination of a weak CYC1 promoter and a CEN/ARS origin of replication, 
which results in low copy numbers of the construct in yeast (usually 1-2 copies per cell). 
This vector allows high stringency screens thanks to the use of a severely truncated Gal4 
protein. Gal4 amino acids 1-74 retains the minimal elements necessary for recognition 
and binding to the GAL promoter, but it lacks the elements necessary for dimerization of 
15 tge Gal4 protein. Therefore, binding of Gal4 amino acids 1-74 to the GAL1 promoter is 
not cooperative anymore. The non-cooperative mode of Wrf&q severely reduces the 
affinity of Gal4 amino acids 1-74 for the GAL1 promoter as spared to Gal4 amino ac- 
ids 1 -93. Consequently, higher levels of Gal4 amino acids 1 -74 are needed in the nucleus 
to activate transcription of the reporter genes. This higher level can only be reached by 
20 an overall higher level of released Gal4 (amino acids 1-74)-bait polypeptide. Only a very 
strong interaction between a bait protein and a prey protein is able to release the 
amounts of Gal4 (amino acids 1-74)-bait polypeptide needed to activate transcription of 
the reporter genes. 

25 (20) pCMGA93-MCS is a low copy yeast/E. coli shuttle vector carrying a CUP1 pro- 
moter for inducible expression in yeast, followed by a nucleic acid sequence encoding the 
N-M GCTLS AEDKPGG P-C signal sequence for fatty acid modification, followed by a nu- 
cleic acid sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a 
sequence encoding amino acids 1-93 of the yeast Gal4 protein, followed by a multiple 

30 cloning site encoding several recognition sites for restriction endonucleases, which can 
be used to insert a nucleic acid encoding the polypeptide of interest, followed by an 
ADH1 terminator. The backbone of the construct contains the LEU2 gene for selection in 
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yeast, the kanamycine resistance cassette for selection in E. coli, the CEN/ARS origin of 
replication for propagation in yeast and the pUC origin of replication for propagation in E. 
coli. This vector is suitable for the inducible expression of CbM-fused polypetides in 
yeast, due to the combination of an inducible CUP1 promoter and a CEN/ARS origin of 
5 replication, which results in low copy numbers of the construct in yeast (usually 1-2 cop- 
ies per cell). 

(21) pCMGA74-MCS is a low copy yeast/E. coli shuttle vector carrying a CUP1 pro- 
moter for inducible expression in yeast, followed by a nucleic acid sequence encoding the 
10 N -MGCTLS AE DKPGG P-C signal sequence for fatty acid modification, followed by a nu- 
cleic acid sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a 
sequence encoding amino acids 1-74 of the yeast Gal4 protein, followed by a multiple 
cloning site encoding several recognition sites for restriction endonucleases, which can 
be used to insert a nucleic acid encoding the polypeptide of interest, followed by an 
15 ADH1 terminator. The backbone of the construct contains the LEU2 gene for selection in 
yeast, the kanamycine resistance o*^*tte for selection in E. coli, the CEN/ARS origin of 
replication for propagation in yeas* m«, ihe pUC origin of replication for propagation in E. 
coli. This vector is suitable for the inducible expression of CbM-fused polypetides in 
yeast, due to the combination of an inducible CUP1 promoter and a CEN/ARS origin of 
20 replication, which results in low copy numbers of the construct in yeast (usually 1-2 cop- 
ies per cell). This vector allows high stringency screens thanks to the use of a severely 
truncated Gal4 protein. Gal4 amino acids 1-74 retains the minimal elements necessary 
for recognition and binding to the GAL promoter, but it lacks the elements necessary for 
dimerization of tge Gal4 protein. Therefore, binding of Gal4 amino acids 1-74 to the GAL1 
25 promoter is not cooperative anymore. The non-cooperative mode of binding severely re- 
duces the affinity of Gal4 amino acids 1-74 for the GAL1 promoter as compared to Gal4 
amino acids 1-93. Consequently, higher levels of Gal4 amino acids 1-74 are needed in 
the nucleus to activate transcription of the reporter genes. This higher level can only be 
reached by an overall higher level of released Gal4 (amino acids 1-74)-bait polypeptide. 
30 Only a very strong interaction between a bait protein and a prey protein is able to release 
the amounts of Gal4 (amino acids 1-74)-bait polypeptide needed to activate transcription 
of the reporter genes. 
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(22) pDSdual-1 is a low copy yeast/E. coli shuttle vector carrying two expression cas- 
settes. 

5 Cassette 1 contains a CYC1 promoter for low level expression in yeast, followed by a 
multiple cloning site containing multiple recognition sites for restriction endonucleases 
and sequences for the in vivo cloning described in section 5.2, followed by a nucleic acid 
sequence encoding amino acids 35-76 of yeast ubiquitin (CbM), followed by a sequence 
encoding amino acids 1-74 or 1-93 of the yeast Gal4 protein, followed by an ADH1 termi- 

10 nator. 

Cassette 2 contains a CYC1 promoter for low level expression in yeast, followed by a 
multiple cloning site containing multiple recognition sites for restriction endonucleases 
and sequences for the in vivo cloning described in section 5.2, followed by the sequence 
15 encoding NbM, followed by a CYC1 terminator. 

Alternative, cassette 1 may contain a CUP1 promoter instead of a CYC1 promoter to 
allow the inducible expression of the bait. 

20 Alternatively, cassette 2 may contain a CYC1 promoter for low level expression in yeast, 
followed by the sequence encoding NbM, followed by a multiple cloning site containing 
multiple recognition sites for restriction endonucleases and sequences for the in vivo 
cloning described in section 5.2, followed by a CYC1 terminator. 



25 



The backbone of the construct contains the LEU2 gene for selection in yeast, the kana- 
mycine resistance cassette for selection in E. coli, the CEN/ARS origin of replication for 
propagation in yeast and the pUC origin of replication for propagation in E. coli. 

This vector is suitable for the low level or inducible expression of a CbM-fused polypetide 
30 together with the low level expression of a prey polypeptide fused either C- or N- 
terminally to NbM. 
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NbM library vectors fore v vectors) 

NbM library vectors are plasmid constructs which preferably contain the following fea- 



tures: 



5 

(1) A selection marker for propagation of the plasmids in E. coli, which preferably is an 
expression cassette encoding the ampicillin resistance gene, but which may also be an 
expression cassette encoding the kanamycine resistance gene or the chloramphenicol 
resistance gene. In fact, any marker that is selectable in E. coli may be used for this pur- 

10 pose. 

(2) An origin of replication that allows propagation of the plasmids in E. coli, such as a 
pUC based or pBR322 origin of replication, but preferably an origin of replication that al- 
lows the plasmid to be propagated in E. coli at a high copy number. 

15 

(3) A selection marker for propagation of the plasmids in S. cerevisae, which preferably is 
an expression cassette encoding the TRP1 gene, but which may « be any marker that 
is selectable in S. cerevisae. 

20 (4) An origin of replication that allows propagation of the plasmids in S. cerevisae, such 
as a 2micron based or CEN/ARS based origin of replication, but preferably an origin of 
replication that allows the plasmid to be propagated in S. cerevisae at a high copy num- 
ber, such as the 2micron origin of replication. 

25 (5) An expression cassette, which preferably contains the following features: 

(1) A promoter element conferring low level expression, such as CYC1, or inducible ex- 
pression, such as GAL1 or CUP1, or conferring high level expression, such as ADH1 or 
TEF1 or promoters having a similar promoter strengh as ADH1. Promoters conferring 
high level expression are preferred. 



30 



(2) An open reading frame encoding the N-terminal part of yeast ubiqutin, encompassing 
amino acids 1-37 (for amino acid numbering of ubiquitin and exact explanation of ubiq- 
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uitin fragments, see Johnsson and Varshavsky, 1994) (NbM), either wild type or bearing 
an amino acid exchange at either position 3 or position 1 3 of the published yeast ubiquitin 
sequence or both, where the amino acid that is used to replace the original amino acid 
can either be Leu, Val, Ala or Gly. Preferably, the replacement is made at position 13 with 
the amino acid glycine and at the positions 3 and 13 with the amino acid glycine. The 
NbM may also contain other mutations that are identified using an in vivo selection pro- 
cedure as described below and whose aim it is to reduce or abolish the affinity of NbM for 
CbM. 

(3) The expression of the bait polypeptide can then be easily verified using standard 
procedures such as detection by immunoblot using a specific antibody directed against 
the 3xFLAG, 3xMYC or HA epitopes that are present in the bait construct. The vector 
contains a nucleic acid sequence encoding an epitope tag, which can be either the 
3xFLAG epitope (Chubet & Brizzard, 1996, Hernan et al., 2000), the 3x MYC epitope 
(Evan et al., 1985), or any other sequence encoding a polypeptide that can be detected 
by any means, such as immunoblotting: a nucleic acid sequence encoding another epi- 
tope tag, such as the hemagglutinin episragsptag (Wilson et al., 1984), may also be used. 

(4) A multiple cloning site containing one or several recognition sites for restriction 
enzymes which allow the insertion of nucleic acid encoding a protein or protein fragment 
or a polypeptide and sequences flanking a defined recognition site for a restriction endo- 
nuclease, preferably the restriction endonuclease Sma I, which are used to insert a nu- 
cleic acid sequence encoding a protein or protein fragment or a polypeptide using the in 
vivo recombination method described below. 

(5) A terminator sequence derived from the yeast ADH1 or CYC1 genes which terminates 
transcription in S. cerevisiae. 

Description of preferred library vectors 



(1 ) pNbM-HA-X: High copy yeast/E. coli shuttle vector carrying the ADH1 promoter for 
high level expression in yeast, followed by the sequence encoding NbM. the sequence 
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encoding the hemaglutinin epitope tag, a multiple cloning site and an ADH1 terminator. 
The backbone of the plasmid contains the TRP1 gene for selection in yeast, the ampicillin 
resistance cassette for selection in E. coli, the 2micron origin of replication for propaga- 
tion in yeast and the pUC origin of replication for propagation in E. coli. This vector is 
5 suitable for the high level expression of NbM-fused polypetides in yeast, due to the com- 
bination of a strong ADH1 promoter and a 2micron origin, which results in high copy 
numbers of the plasmid in yeast. 

(2) pX-HA-NbM: High copy yeast/E. coli shuttle vector carrying the ADH1 promoter for 
10 high level expression in yeast, followed by a multiple cloning site, the sequence encoding 
2 sequential hemaglutinin epitope tags, the sequence encoding NbM, and an ADH1 ter- 
minator. The backbone of the plasmid contains the TRP1 gene for selection in yeast, the 
ampicillin resistance cassette for selection in E. coli, the 2micron origin of replication for 
propagation in yeast and the pUC origin of replication for propagation in E. coli. This 
15 vector is suitable for the high level expression of NbM-fused polypeptides in yeast, due to 
the combination of a strong ADH1 promoter and a 2micron origin, which results in high 
copy numbers «lf the plasmid in yeast. 

(3) P 424NbM-X: High copy yeast/E. coli shuttle vector carrying the CYC1 promoter for 
20 low level expression in yeast, followed by the sequence encoding NbM, the sequence 

encoding the hemaglutinin epitope tag, a multiple cloning site and a CYC1 terminator. 
The backbone of the plasmid contains the TRP1 gene for selection in yeast, the ampicillin 
resistance cassette for selection in E. coli, the 2micron origin of replication for propaga- 
tion in yeast and the pUC origin of replication for propagation in E. coli. This vector is 
25 suitable for the low level expression of NbM-fused polypeptides in yeast, due to the com- 
bination of a very weak CYC1 promoter and a 2micron origin, which results in high copy 
numbers of the plasmid in yeast. 

(4) p424X-NbM: High copy yeast/E. coli shuttle vector carrying the CYC1 promoter for 
30 low level expression in yeast, followed by a multiple cloning site, the sequence encoding 

the hemaglutinin epitope tag, the sequence encoding NbM, and a CYC1 terminator. The 
backbone of the plasmid contains the TRP1 gene for selection in yeast, the ampicillin 
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resistance cassette for selection in E. coli, the 2micron origin of replication for propaga- 
tion in yeast and the pUC origin of replication for propagation in E. coli. This vector is 
suitable for the low level expression of NbM-fused polypeptides in yeast, due to the com- 
bination of a very weak CYC1 promoter and a 2micron origin, which results in high copy 
5 numbers of the plasmid in yeast. 

(5) p414NbM-X: Low copy yeast/E. coli shuttle vector carrying the ADH1 promoter for 
high level expression in yeast, followed by the sequence encoding NbM, the sequence 
encoding the hemaglutinin epitope tag, a multiple cloning site and a CYC1 terminator. 

10 The backbone of the plasmid contains the TRP1 gene for selection in yeast, the ampicillin 
resistance cassette for selection in E. coli, the CEN/ARS origin of replication for propaga- 
tion in yeast and the pUC origin of replication for propagation in E. coli. This vector is also 
suitable for the low level expression of NbM-fused polypetides in yeast, due to the combi- 
nation of a strong ADH1 promoter and a CEN/ARS origin, which results in very low copy 

1 5 numbers of the plasmid in yeast (usually 1 -2 copies per cell). 

(6) p414X-NbM: Low copy yeast/E. coli shuttle vector carrying the ADH1 promoter for 
high level expression in yeast, followed by a multiple cloning site, the sequence encoding 
two sequential hemaglutinin epitope tags, the sequence encoding NbM, and a CYC1 ter- 

20 minator. The backbone of the plasmid contains the TRP1 gene for selection in yeast, the 
ampicillin resistance cassette for selection in E. coli, the CEN/ARS origin of replication for 
propagation in yeast and the pUC origin of replication for propagation in E. coli. This 
vector is also suitable for the low level expression of NbM-fused polypeptides in yeast, 
due to the combination of a strong ADH1 promoter and a CEN/ARS origin, which results 

25 in very low copy numbers of the plasmid in yeast (usually 1 -2 copies per cell). 

Library construction 



Libraries can be constructed from nucleic acids such as genomic DNA, random or non 
30 random oligonucleotides or cDNAs. 
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(1) Genomic libraries. Construction of random fragments of genomic DNA is a stan- 
dard procedure and is described for example in (James et al., 1996). Briefly, genomic 
DNA is isolated from an organism of interest and is sheared into fragments of random 
size either by sonication or by other methods, such as digestion with appropriate restric- 

5 tion enzymes or shearing by extrusion through a syringe. Sheared fragments of the ap- 
propriate size are selected and the ends are repaired using standard procedures, such as 
Klenow/T4 DNA polymerase reactions or Mung Bean exonuclease reactions. Following 
this, the repaired fragments are cloned into a NbM-based library vector using standard 
procedures such as described in (Bedbrook & Ausubel, 1976, Sambrook & Russell, 

10 2001). 

(2) cDNA libraries are made using standard procedures, as described in (Gubler, 
1988), or can be made using various commercially available kits, such as those marketed 
by Clontech, Invitrogen, or Promega. In a preferred application double-stranded cDNA 

15 fragments resulting from the use of the kit are directly inserted into an appropriate NbM 
library vector by means of in vivo recombination in yeast, as descritr (Fusco et al., 
1999, Prado & Aguilera, 1994). Nucleic acid sequences that are c^r-, -gisii>entary to se- 
quences in the expression cassette of the NbM vector are added to each end of the 
cDNA fragments, as described in the user manual of the MATCHMAKER System 3 Li- 

20 brary Construction Kit (Clontech, Palo Alto, California, USA). Simultaneous introduction of 
cDNA fragments and the NbM library vector in appropriate amounts using a standard 
transformation procedure, as described for example (Gietz & Woods, 2001), results in 
homologous recombination between identical sequences at the ends of the cDNA frag- 
ments and results in the generation of NbM library vectors each containing a particular 

25 cDNA insert. Yeast transformants resulting from this procedure are collected using a 
standard procedure, such as described in the MATCHMAKER System 3 User Manual or 
the MATCHMAKER System 3 Library Construction Kit (Clontech) and can be stored at - 
80°C for later screening by mating. In a different procedure, the cDNA fragments and the 
NbM library vector are cotransformed not in an untransformed yeast strain but into a 

30 yeast strain which has already been transformed with a plasmid encoding the bait to be 
used in a library screening. Following the identical transformation procedure as described 
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previously, the yeast transformants are directly plated on selective medium and a screen 
is carried out as described in detail below. 

(3) Random or non-random oligonucleotide libraries. Random or non-random oligonu- 
5 cleotides are synthesized according to standard procedures. They can be inserted into 
the NbM library vector using either one or several restriction endonucleases or by the in 
vivo cloning method described above. 

Library screening procedure 

10 

The polypeptide to be used as bait may be inserted into both a bait vector conferring low 
level expression, and a bait vector conferring high level expression, these vectors being 
used in a parallel procedure. In order to avoid false-positives, if a promoter conferring 
high level expression is used for the bait protein carrying CbM-TDA, the nucleic activity of 
15 the TDA may be controlled using other mechanisms (e.g. use of a mutated transcription 

factor such as a mutated LexA). The prey z ^f, preferably carries a strong promoter, . ^ 
such as ADH1 or TEFL 

1 . Definition of appropriate baits 

20 

In principle, any polypeptide that can be immobilized at a membrane within the yeast cell, 
such as the membranes of the endoplasmatic reticulum, the golgi apparatus, the outer or 
inner mitochondrial membrane or, most preferably, the plasma membrane, is suitable as 
a bait in the MbY2H system. Polypeptides that are to be used as baits can be integral 
25 membrane proteins, or they can be soluble proteins that are attached to a membrane via 
fusion to a signal sequence which encodes a membrane anchor, such as a signal se- 
quence followed by a transmembrane domain, or a signal sequence for any fatty acid 
modification, such as myristoylation, palmytoylation, or farnesylation. The different types 
of polypeptides to be used as baits are described in detail below. 

30 



2. Bait construction 
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The nucleic acid sequence encoding the polypeptide to be used as bait can be inserted 
into one of the bait vectors described above using standard molecular biology techniques. 
These techniques are described in detail in (Bedbrook & Ausubel, 1976, Sambrook & 
Russell, 2001). However, the preferable method of inserting the nucleic acid sequence 
5 into a bait vector is by in vivo recombination in yeast (Oldenburg et al., 1997, Prado & 
Aguilera, 1994). The nucleic acid sequence encoding the polypeptide of interest is ampli- 
fied from an appropriate template (such as genomic DNA, cDNA or preferably, plasmid 
DNA) using the polymerase chain reaction (PCR, (Mullis, 1990)). Primers are designed to 
contain a sequence that is complementary to the extreme 5' and 3' ends of the nucleic 

10 acid sequence of interest Furthermore, each primer contains an additional 26 nucleo- 
tides of sequence at the 5' end which is identical to the 26 nucleotides of sequence on 
each side of a recognition site of a restriction endonuclease in one of the bait vectors de- 
scribed above. Thus, the PCR product contains the entire nucleic acid fragment encoding 
the polypeptide to be used as bait, as well as 26 nucleotides of complementary sequence 

15 (designated the Japs") at the 5' and 3' end. The bait vector is treated by incubation with 
the appropriate restr^tion endonuclease as described in standard protocols (Bedbrook & 
Ausubel, 1976, Sam&ooK & Russell, 2001). The PCR fragment and the restricted bait 
vector are mixed in equimolar amounts and transformed into an appropriate yeast re- 
porter strain using standard yeast transformation techniques (Gietz & Woods, 2001). 

20 Thanks to its endogenous repair machinery (Friedberg, 1991a, Friedberg, 1991b), each 
end of the PCR fragment is recombined with its homologous counterpart in the bait vec- 
tor, resulting in a nucleic acid construct, where the PCR product is inserted exactly at the 
open ends of the bait vector (Figure ???). Because this process is absolutely specific in 
yeast, the PCR fragment is inserted in a defined orientation and no nucleotides at the end 

25 are either omitted or added. For this reason, insertion of a nucleic acid sequence encod- 
ing a bait polypeptide via this method is simpler and faster than the conventional insertion 
methods mentioned above. The insertion of the nucleic acid encoding the bait polypep- 
tide must be made in a way so that the reading frame of the bait polypeptide is the same 
as that of the reporter cassette following the multiple cloning site. In this way, translation 

30 of the bait plasmid in yeast provides a continuous polypeptide that consists of the bait 
polypeptide fused to CbM and the elements described above. 
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3. Bait verification 

Prior to screening, it is necessary to test whether the bait is expressed correctly in yeast. 
Correct expression means that the bait polypeptide is translated in its full length in yeast, 
5 that no significant proteolysis occurs and that the bait is inserted into the membranes in 
yeast First, the bait construct is transformed into an appropriate yeast strain, which can 
be any yeast strain, but which preferably is one of the strains described above and most 
preferably, one of the yeast strains DSY1 or DSY2. The yeast transformant is grown un- 
der appropriate selective conditions and expression of the bait is detected using the im- 

10 munoblotting technique. The detection of polypeptides expressed in yeast by the immu- 
nobloting technique is standard and is described for example in (Bedbrook & Ausubel, 
1976, Sambrook & Russell, 2001) and in the YEAST PROTOCOLS HANDBOOK (Clon- 
tech). The appearance of a signal of the expected molecular weight on the autoradio- 
graph signifies that the bait polypeptide is expressed in yeast and that it is not signifi- 

15 cantly degraded by endogenous proteases. 

Testing whether the bait polypeptide is inserted into the membranes in yeast is.a crucial 
step since it gurantees that the bait is immobilized at the membrane and cannot diffuse 
into the nucleus. The immobilization of the bait at the membrane is an absolute prerequi- 

20 site for a MbY2h screen. First, the bait construct is transformed into an appropriate yeast 
strain, which can be any yeast strain, but which preferably is one of the strains described 
above and most preferably, either of the yeast strains DSY1 or DSY2. The yeast trans- 
formant is grown under appropriate selective conditions and the insertion of the bait into 
the membrane is verified by detecting its presence in the particulate fraction following 

25 lysis of the yeast transformants and separation of extracts into soluble and particulate 
fractions, as described for example in (Fuentes et al., 2000). The appearance of a signal 
of the expected molecular weight in the particulate fraction, but not in the soluble fraction, 
on the autoradiograph signifies that the bait polypeptide is expressed and inserted into 
the membranes in yeast. 

30 

A second method to verify the insertion of the bait polypeptide into the yeast membrane is 
to carry out immunofluorescence on the yeast transformants. First, the bait construct is 
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transformed into an appropriate yeast strain, which can be any yeast strain, but which 
preferably is one of the strains described above, and most preferably, either of the yeast 
strains DSY1 or DSY2. The yeast transformant is grown under appropriate selective con- 
ditions and processed for immunofluorescence using any of the standard methods avail- 
5 able, as described for instance in (Burke et al., 2000). Following the immunofluorescent 
staining, the cells are observed under a fluorescence microscope. The presence of a fluo- 
rescent signal at the membranes of yeast signifies that the bait polypeptide is expressed 
and inserted into the membranes in yeast. 

10 4. Self-activation test 

A self-activation test is carried out to ensure that the bait polypeptide, when expressed in 
the absence of any prey polypeptide, does not activate the reporter genes present in the 
reporter strain. First, the bait construct is transformed into an appropriate yeast strain, 

15 which can be any yeast strain, but which preferably is either of the strains described 
above and most preferably, one of the yeast strains DSY1 or DSY2. The v^; ^ transfer- ^ 
mant is plated on different selective media to assess ths activation of ai: mt^urter genes - 
present in the reporter strain. For example, when using the reporter strain AH 109, yeast 
transformants carrying a bait construct and a control construct to supply the TRP1 gene 

20 are plated on selective plates lacking leucine (to select for the presence of the bait con- 
struct), tryptophan (to select for the presence of the control construct) and adenine (to 
assess the activity of the ADE2). Growth of yeast transformants on the selective plates 
signifies that the bait polypeptide alone is able to activate the reporter genes. Such a bait 
cannot be used for a MbY2H screen. 

25 

5. Conventional library transformation 

In the conventional screening technique, the yeast reporter strain is first transformed with 
the bait plasmid as described above (resulting in a yeast strain carrying multiple copies of 
30 the bait plasmid, designated the „bait-bearing strain"). The yeast strain may be any S. 
cerevisiae strain bearing appropriate reporter constructs, either as autonomously repli- 
cating plasmids or as nucleic acid fragments that are integrated into the genome, but 
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preferably, it is the yeast strain L40 [Vojtek, 1993 #398] or the yeast strain AH 109 (Clon- 
tech) or the yeast strain PJ69-4A (James et a)., 1996) or most preferably, it is either the 
yeast strains DSY1 or DSY2. The bait-bearing strain is subsequently transformed with a 
library of NbM-fused nucleic acids, which can be either genomic fragments, random or 
non-random oligonucleotides or preferably, cDNA fragments. Such a library may be con- 
structed in any of the NbM library vectors described above, using any of the methods de- 
scribed herein. 

6. Library transformation by the mating procedure 

Using the mating procedure, the bait plasmid is introduced into a yeast reporter strain of a 
defined mating type (either a or alpha) using any of the methods described above. Then, 
a library, which can be any library described above, is introduced into a strain of the op- 
posite mating type (either alpha or a) using any of the methods described above. Subse- 
quently, the bait-bearing and library-bearing strains are mated according to standard 
methods (Figure 2; (Soellick & Unrig, 2001), MATCHMAKER System 3 User Manual, 
Clontech). Yeast strains used for the transforrcs&M otthe bait are L40, AH109, PJ69-4A 
or preferably, DSY1 or DSY2. Yeast strains used to transform the library are L40alpha, 
Y187, PJ69-4alpha or preferably, DSYM. 

7. Screening procedure 

Following transformation or mating, the yeast transformants are plated on solid selection 
medium. This selection medium is composed in a way as to select for the presence of (1) 
the bait plasmid, (2) the library plasmid and (3) any potential interaction occurring be- 
tween the bait and prey polypeptides. 

The type and formulation of the selection medium to be used depends on the reporter 
strain that is used. 

When using L40 (see below), selective medium lacking (to select for the presence of the 
bait plasmid), tryptophan (to select for the presence of the prey plasmid), and histidine (to 
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select for the occurrence of a protein-protein interaction) is used. 

When using AH109 (see below), selective medium lacking the amino acids leucine (to 
select for the presence of the bait plasmid), tryptophan (to select for the presence of the 
5 prey plasmid), and histidine and adenine (to select for the occurrence of a protein-protein 
interaction) is used. As described herein, it is possible to modulate the stringency of the 
screen using the following modifications in the selective medium: (1) low stringency: se- 
lective medium lacking the amino acids leucine, tryptophan and histidine. (2) medium 
stringency: selective medium lacking the amino acids leucine, tryptophan and adenine. 
10 (3) high stringency: selective medium lacking the amino acids leucine, tryptophan, histid- 
ine and adenine. 

When using PJ69-4A (see below), selective medium lacking leucine (to select for the 
presence of the bait plasmid), tryptophan (to select for the presence of the prey plasmid), 

15 and histidine and adenine (to select for the ocurrence of a protein-protein interaction) is 
used. As described her^i ?, it is possible to modulate the stringency of the screen using 
the following modificazfpm in the selective medium: (1) low stringency: selective medium 
lacking leucine, tryptophan and histidine. (2) medium stringency: selective medium lack- 
ing the amino acids leucine, tryptophan and adenine. (3) high stringency: selective me- 

20 dium lacking leucine, tryptophan, histidine and adenine. 

When using DSY1 (see below), selective medium lacking leucine (to select for the pres- 
ence of the bait plasmid), tryptophan (to select for the presence of the prey plasmid), his- 
tidine and adenine (to select for the occurrence of a protein-protein interaction) and con- 

25 taining the compound 5'-fluoroorotic acid (5-FOA, to counterselect for unspecific protein- 
protein interactions) is used. As described herein, it is possible to modulate the stringency 
of the screen using the following modifications in the selective medium: (1) low strin- 
gency: selective medium lacking leucine, tryptophan and histidine and containing the 
compound 5-FOA. (2) medium stringency: selective medium lacking, tryptophan and 

30 adenine and containing the compound 5-FOA. (3) high stringency: selective medium 
lacking leucine, tryptophan, histidine and adenine and containing the compound 5-FOA. 
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When using DSY2 (see below), selective medium lacking leucine (to select for the pres- 
ence of the bait plasmid), tryptophan (to select for the presence of the prey plasmid), and 
histidine and adenine (to select for the ocurrence of a protein-protein interaction) is used. 
As described herein, it is possible to modulate the stringency of the screen using the fol- 
5 lowing modifications in the selective medium: (1) low stringency: selective medium lack- 
ing leucine, tryptophan and histidine. (2) medium stringency: selective medium lacking, 
tryptophan and adenine. (3) high stringency: selective medium lacking leucine, trypto- 
phan, histidine and adenine. 

10 Using any of the strains mentioned above, an independent test for the protein-protein 
interaction taking place in the particular yeast tranformant can be made by assaying the 
activity of the lacZ reporter. Each of the strains described above carries one or several 
copies of a lacZ reporter gene integrated into its genome. The lacZ reporter is activated in 
the same fashion as the other reporters and, when activated, leads to the transcription 

15 and translation of the beta-galactosidase polypeptide from E. colL The presence of Beta- 
c;3lactosidase can be verified by its ability to convert various substrates into colored com- tt 
pounds. The assays used to detect beta-galactosidase have been described numerous 
times ((Golemis et al., 1999), MATCHMAKER System 3 User Manual, Clontech). Any 
method to detect beta-galactosidase is suitable for use in the MbY2H system. 

20 

For each yeast transformant that was initially plated on the selection medium, two differ- 
ent outcomes are possible: 

(1) The bait and prey polypeptides encoded by the bait and library plasmid, respec- 
25 tively, do not interact in yeast. In this case, the bait and prey polypeptides will not be spa- 
tially close, no reconstitution of NbM and CbM will take place and no split-ubiquitin will be 
formed. Since no split-ubiquitin is formed, the transcriptional activator polypeptide will not 
be cleaved off by ubiquitin-specific proteases and the transcriptional activator polypeptide 
is unable to reach the nucleus of the yeast cell. As a consequence, the reporter genes of 
30 the yeast transformant are not activated and will not produce any reporter protein (Figure 
6A). Therefore, the respective yeast transformant will be unable to grow on the selective 
medium. 
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(2) The bait and prey polypeptides encoded by the bait and library plasmid, respec- 
tively, do interact in yeast. In this case, the bait and prey polypeptides will be spatially 
close, reconstitution of NbM and CbM will take place and split-ubiquitin will be formed. 
5 Since split-ubiquitin is formed, the transcriptional activator polypeptide will be cleaved off 
by ubiquitin-specific proteases and the transcriptional activator polypeptide will diffuse to 
the nucleus of the yeast cell. As a consequence, the reporter gene(s) of the yeast trans- 
formant is (are) activated and will produce the reporter protein(s) (Figure 6B). Therefore, 
the respective yeast transformant will be able to grow on the selective medium and will 
10 appear as isolated yeast colonies after 3-5 days incubation at 30°C. 

A third outcome is possible when using either of the strains DSY1 or DSY2: The bait and 
prey polypeptides encoded by the bait and library plasmid, respectively, do interact in 
yeast, but the prey polypeptide also interacts with a control bait polypeptide that is tran- 

15 scribed as a CbM-LexM-B42 fusion from a nucleic acid sequence integrated into the ge- 
nome of the yeast reporter strain. In this case, the control bait and prey polypu^#is will : 
be spatially close, reconstitution of NbM and CbM will take place and split-ui?^^will be , >v 
formed. Since split-ubiquitin is formed, the LexM-B42 polypeptide will be cleaved off by 
ubiquitin-specific proteases and will diffuse to the nucleus of the yeast cell. As a conse- 

20 quence, the URA3 or FAR1 reporter genes of the yeast transformant are activated and 
will produce the Ura3 or Far1 protein, respectively. Ura3 protein converts the compound 
5-FOA into a metabolite that is toxic to the yeast cell, whereas Far! protein interferes di- 
rectly with the cell cycle machinery and arrests the yeast transformant. In both cases, the 
particular yeast transformant will be unable to grow on selective medium. 

25 

8. Bait dependency test 

Yeast transformants that have survived the selection procedures described in section 7 
most likely carry a library construct encoding a polypeptide that is able to interact with the 
30 bait polypeptide in the yeast reporter strain. However, as with any in vivo system, there is 
a remote chance of isolating so-called „false positives" which are library constructs en- 
coding particular prey polypeptides that confer upon the yeast transformant the ability to 
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survive the selection procedures described in section 7, but that nevertheless do not in- 
teract with the bait polypeptide. A so-called bait dependency test is carried out to isolate 
these false positives from the screen. 

5 First, nucleic acids are isolated from yeast transformants using any of the standard meth- 
ods available (see, for instance, MATCHMAKER System 3 User Manual, Clontech). The 
nucleic acid mixture contains (1) the bait construct, (2) the prey construct and (3) geno- 
mic DNA from the yeast reporter strain. In order to selectively isolate the prey construct, 
the nucleic acid mixture is transformed into an appropriate E. coli strain, which can be 

10 any E. coli strain suitable for cloning purposes, but which preferably is any of the strains 
DH5alpha, DH10B, JM109, TOP10, XL-10 GOLD, SURE and which most preferably is 
the strain XL1-BLUE. The transformants are plated on standard bacterial culture plates 
supplemented with the antibiotic whose corresponding resistance gene is encoded on the 
prey construct. This can be any antibiotic but preferably is the antibiotic ampicillin. Since 

15 E. coli that have been transformed with either the bait construct or with fragments of 
yeast genomic DNA cannot grow on selective plates ^:^*he antibiotic ampicillin, all colo- ^ r 
nies arising on those plates are transformants carrv*^ fee prey construct. Large amounts ; > v > 
of the prey construct are then isolated using any of the standard techniques for plasmid 
isolation from E. coli (Bedbrook & Ausubel, 1976, Sambrook & Russell, 2001); Plasmid 

20 Isolation Kit, Macherey Nagel AG, DQren, Germany, Plasmid Isolation Kit, Quiagen Inc., 
Madison, Wl, USA). 

The isolated library constructs are cotransformed into an appropriate yeast reporter 
strain, which can be AH109, Y190, L40, PJ69-4A, DSY1 or DSY2 but which preferably is 
25 either of Y187 or L40, together with (1) the bait construct and (2) a control bait construct 
encoding an unrelated bait polypeptide, which can be any polypeptide, but which prefera- 
bly is any of Ost1, Wbp1, Alg5, ErbB3 and which most preferably is Alg5. The yeast 
transformants are plated on appropriate selective plates as described in section 7 and 
colonies that have survived selection are assayed for the activity of the lacZ reporter. 

30 



The following outcomes are possible: 
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(1) The prey polypeptide interacts with the original bait polypeptide and the control poly- 
peptide. In this case, cotransformants of bait construct and prey construct, as well as co- 
transformants of control bait construct and prey construct will activate the lacZ reporter. 
Such prey constructs encode likely false positives that unspecifically interact with any bait 

5 polypeptide expressed in the same yeast cell and are sorted out. 

(2) The prey polypeptide interacts with the original bait polypeptide but not with the con- 
trol polypeptide. In this case, cotransformants of bait construct and prey construct, but not 
cotransformants of control bait construct and prey construct will activate the lacZ reporter. 

10 Such prey constructs encode true positives that interact only with the bait polypeptide. 

9. Sequencing of library inserts 

Any library constructs that are bait-dependent i.e. they interact with the original bait poly- 
15 peptide but not with the control polypeptide) most likely encode a polypeptide that inter- 
acts with the bait polypepttf-*.;? in yeast. To identify the polypeptide, the nucleic acid se- 
quence in the library confers which encodes the prey polypeptide is determined using 
standard DNA sequencing methodology. The determined nucleic acid sequence can then 
be used to predict the corresponding polypeptide it encodes, and the nucleic acid se- 
20 quence or the derived amino acid sequence can be used to search any public database 
holding sequence information, such as GenBank or the EMBL database, using any algo- 
rithm for the comparison of nucleic acid or amino acid sequences (such as BLAST, 
(Altschul etal., 1997). 

25 The sequence of the polypeptide that is identified in this way may encode a protein that is 
already known to interact with the bait or it may encode a novel interactor. The most im- 
portant advantage of the MbY2H system as compared to other methods for the detection 
of protein-protein interactions involving membrane-associated proteins is that it delivers 
not only the identity of the interacting protein but also all or part of the nucleic acid se- 

30 quence encoding it. The nucleic acid sequence can be used by the investigator in numer- 
ous ways to conduct further experiments in order to verify and characterize the interaction 
between the proteins found in the MbY2H screen. 
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Meaningful results of the screen 

Simultaneous expression of a particular bait polypeptide and a particular prey polypeptide 
in a yeast cell yields a transformant that survives selection by specifically activating the 
reporter genes by means of a reconstitution of split-ubiquitin from CbM and NbM fused to 
the bait and prey, respectively. Selection Is carried out on selective media and confirmed 
by a colorimetric assay (section 7 above). The ability to reconstitute split-ubiquitin implies 
close spatial proximity of the bait and prey polypeptides and therefore, it is likely (de- 
pending on the stringency used in the screen) that the two polypeptides used as bait and 
prey interact in yeast. If the two polypeptides interact in yeast, this is a strong indication 
that they also interact in vivo in other settings, such as in a mammalian cell. Thus, the 
MbY2H system can be used to identify pairs of interacting proteins by using a predeter-. 
mined protein (the bait) and a large collection (library) of other proteins (the preys). The 
MbY2H provides a means of identifying selectively those proteins that interact with the 

In another application of the system, the stringency threshold is set so low that also very 
weak and especially transient interactions are detected by the same mechanism. This 
implies transient spatial proximity between the bait polypeptide and the prey polypeptide. 
Using such a low stringency screen the MbY2H system can serve as a method to deter- 
mine all proteins that at one point are spatially very close to the bait polypeptide. 

Reporter strains 
1. L40 

The yeast reporter strain L40 has the following genotype: MATa his3O200 trp1-901 Ieu2- 
3112 ade2 LYS2::(4lexAop-HIS3) URA3::(8lexAop-lacZ) GAL4. The construction of L40 
is described in detail in (Vojtek et al., 1993). L40 can be used to select for interacting bait 
and prey polypeptides using the MbY2H system as described in section 5. To select 
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yeast transformants expressing interacting bait and prey polypeptides, the HIS3 reporter 
gene and the lacZ reporter gene are used. 

2. AH 109 

5 

The yeast reporter strain AH109 has the following genotype: MATa, trp1-901, leu2-3, 
112, ura3-52, his3-200, gal4U, gal80n, LYS2 : : GAL1(UAS)-GAL1(TATA)-HIS3, MEL1 
GAL2(UAS)-GAL2(TA TA)-ADE2, URA3::MEL1(UAS)-MEL1(TATA)-lacZ. The construc- 
tion of AH109 is described in detail in the MATCHMAKER System 3 User Manual (Clon- 

10 tech). AH109 can be used to select for interacting bait and prey polypeptides using the 
MbY2H system as described in section 5. To select yeast transformants expressing inter- 
acting bait and prey polypeptides, the HIS3 marker, the ADE2 marker, the MEL1 marker 
and the /acZ marker are used. As compared to L40, the advantage of AH 109 lies in the 
option to vary the stringency of the screen by using the following modifications in the se- 

15 lective medium: (1) low stringency: selective medium lacking the amino acids leucine, 

tryptophan and histidine. (2) medium stringency: selective medium lacking leucine, rv/p- < 
tophan and adenine. (3) high stringency: selective medium lacking, tryptophan, 1 
and adenine. 

20 3. PJ69-4A 

The yeast reporter strain PJ69-4A has the following genotype: MATa trp1-901 leu2-3, 112 
ura3-52 his3-200 gal4Q ga!80u LYS2::GAL1-HIS3 GAL2-ADE2 met2::GAL7-lacZ. The 
construction of PJ69-4A is described in detail in (James et al., 1996). PJ69-4A can be 

25 used to select for interacting bait and prey polypeptides using the MbY2H system as de- 
scribed above. To select yeast transformants expressing interacting bait and prey poly- 
peptides, the HIS3 marker, the ADE2 marker and the lacZ marker are used. As compared 
to L40, the advantage of PJ69-4A lies in the option to vary the stringency of the screen by 
using the following modifications in the selective medium: (1) low stringency: selective 

30 medium lacking leucine, tryptophan and histidine. (2) medium stringency: selective me- 
dium lacking leucine, tryptophan and adenine. (3) high stringency: selective medium 
lacking leucine, tryptophan, histidine and adenine. 
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4. DSY1 

The yeast reporter strain DSY1 contains the following reporter cassettes integrated into 

5 the genome: GAL1 promoter followed by HIS3 gene, GAL2 promoter followed by ADE2 
gene, GAL1 promoter followed by lacZ gene. DSY1 is constructed as follows: The yeast 
strain used for construction of DSY1 is DGY63, which has a high transformation effi- 
ciency. Reporter constructs encoding GAL1-HIS3, GAL2-ADE2 and GAL1-lacZ are sub- 
cloned into the integrative plasmid pFA6a(URA3) (Gietz & Sugino, 1988) using standard 

10 methods and are integrated into the DGY63 genome by multiple rounds of URA3 selec- 
tion and 5-FOA-mediated pop-out, as described in (James et at., 1996). The negative 
reporter construct containing a hybrid promoter consisting of four LexA operator sites 
and a CYC1 upstream activating sequence, followed by the URA3 gene and the CYC1 
terminator. The expression cassette containing the minimal CYC1 promoter, followed by 

15 the nucleic acid sequence encoding a control bait polypeptide fused to LexM-B42-CYC1 

terminator is subcloned into the integrative plasmid pFA6a(KanMX) using standard yeast j 
methodology (Burke et al., 2000) and is integrated into the -genome using G41 8-mediated 
selection as described in (Burke et al., 2000). Alternatively, a variant of DSY1 can be 
constructed by integration of the negative reporter into the genome of AH 109 or PJ69-4A 

20 as described above. 

To select yeast transformants expressing interacting bait and prey polypeptides, the HIS3 
marker, the ADE2 marker and the lacZ marker are used. As compared to L40, the ad- 
vantage of DSY1 lies in the option to vary the stringency of the screen by using the fol- 
25 lowing modifications in the selective medium: (1) low stringency: selective medium lack- 
ing leucine, tryptophan and histidine and containing the compound 5-FOA. (2) medium 
stringency: selective medium lacking, tryptophan and adenine and containing the com- 
pound 5-FOA. (3) high stringency: selective medium lacking, tryptophan, histidine and 
adenine and containing the compound 5-FOA. 



30 
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As compared to AH 109 and PJ69-4A, the advantage of DSY1 lies in the option to carry 
out a bait-dependency test simultaneously during the screening procedure, as described 
above. 

5 5. DSY2 

DSY2 is constructed in a similar fashion as DSY1 , except that the integrated cassette 
contains 4xLexA operator sites-CYC1-UAS-FAR1-CYC1 terminator and CYC1 promoter- 
nucleic acid sequence encoding control bait polypeptide-LexM-B42-CYC1 terminator on 
10 the integrative plasmid pFA6a(kanMX). 

To select yeast transformants expressing interacting bait and prey polypeptides, the HIS3 
marker, the ADE2 marker and the lacZ marker are used. As compared to L40, the ad- 
vantage of DSY2 lies in the option to vary the stringency of the screen by using the fol- 
15 lowing modifications in the selective medium: (1) low stringency: selective medium lack- 
ing the amino acids leucine, tn;n*ophan and histidine. (2) medium stringency: selective 
medium lacking the amino ac*4r< leucine, tryptophan and adenine. (3) high stringency: 
selective medium lacking the amino acids leucine, tryptophan, histidine and adenine. 

20 As compared to AH 109 and PJ69-4A, the advantage of DSY2 lies in the option to carry 
out a bait-dependency test simultaneously during the screening procedure, as described 
above. 

As compared to DSY1 , the advantage of DSY2 lies in the fact that the selective medium 
25 does not have to be supplemented with the compound 5-FOA. 

6. L40alpha 

L40alpha is a derivative strain of L40, which has been switched to the opposite mating 
30 type (from a to alpha) using standard procedures (Burke et al., 2000). It is used together 
with L40 in the mating variation of the screening procedure described above. 
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7. Y187 

The yeast reporter strain Y187 has the following genotype: MATalpha ura3-52 his3-200 
ade2-101 trp1-901 Ieu2-3,112 gal4u met- gal80n URA3::GAL1(UAS)-GAL1(TATA)-lacZ 
5 MEL1. Its construction is described in detail in (Harper et al., 1993). It is used together 
with AH 109 in the mating variation of the screening procedure described above. 

8. PJ69-4alpha 

10 PJ69-4alpha is a derivative strain of PJ69-4A, which has been switched to the opposite 
mating type (from a to alpha) using standard procedures (Burke et al., 2000). It is used 
together with PJ69-4A in the mating variation of the screening procedure described 
above. 

15 9. DSYM 

DS^fW-'Ms derivative strain of DGY63, which has been switched to the opposite mating 
type (from a to alpha) using standard procedures (Burke et al., 2000). It is used together 
with DSY1 or DSY2 in the mating variation of the screening procedure described above. 

20 

10. DSYDS1 

Constructed from DGY63 by knocking out the yeast PDR1-3 genes (which encode tran- 
scription factors regulating the expression of Erg transporters in yeast) using standard 
25 methodology (Burke et al., 2000). A reporter cassette containing GAL1-URA3 and GAL1- 
lacZ is integrated into the URA3 locus using standard methodology (Burke et al., 2000). 

11. DSYDS2 



30 



Constructed from DGY63 by knocking out the yeast PDR1-3 genes (which encode tran- 
scription factors regulating the expression of Erg transporters in yeast) using standard 
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methodology (Burke et al., 2000). A reporter cassette containing GAL1-FAR1 and GAL1- 
lacZ is integrated into the URA3 locus using standard methodology (Burke et al., 2000). 

Real-time reporter element to detect NbM-CbM association to form split-ubiquitin 

5 

1 . A reporter consisting of a firefly luciferase-CbM-GFP (FCG) fusion can be used to 
detect protein-protein interactions in real time. Activation of luciferase leads to fluo- 
rescent energy transfer (FRET) across the CbM to GFP, which becomes activated 
and emits green light. If a bait protein fused to FCG and a prey protein fused to 

10 NbM are coexpressed in yeast, the interaction of the two proteins results in the re- 

constitution of split-ubiquitin and the cleavage of the CbM-GFP boundary. GFP is 
liberated and diffuses away from the luciferase-CbM fusion. Consequently, no 
FRET between luciferase and GFP takes place. If a compound is introduced that 
blocks the interaction between bait and prey protein, GFP is not cleaved off any- 

15 more and FRET between luciferase and GFP will result in the emission of green 

light. Thus, the effectiveness of a compound in blocking the interaction betwessi • 
two proteins can be measured directly by the intensity of GFP fluorescence. 

2. Reporter strain DSYDS3 

20 

DSYDS3 is derived from strain DSYDS2 by integrating a reporter cassette con- 
taining CYC 1 -luciferase-CbM-GFP into the LYS2 locus using standard methodol- 
ogy (Burke et al., 2000). 

25 Applications of the MbY2H system 

Screening for small compounds that disrupt a defined protein-protein interaction 



30 



The aim of a small compound screen is to identify small compounds that can block a 
protein-protein interaction between a bait polypeptide and a prey polypeptide. The term 
..small compound" is meant to signify any small molecule with a molecular weight up to 5 
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kDa, which can be chemically synthesized, extracted or enriched from natural products, 
or which can be polysaccharide compounds. 

A small compound library is any collection of the abovementioned small compounds, 
5 which can be a combinatorial library, a library of defined chemically synthesized com- 
pounds, a library of random chemical compounds, a library of natural compounds, a li- 
brary of natural extracts. The number of compounds in this library may be smaller than 
100 compounds, 100-1000 compounds, 1000-10000 compounds, or more than 10000 
compounds. 

10 

Usually, such a library is supplied in an arrayed format, e.g. in multiwell microtiter plates, 
but it may also be supplied as pools of compounds or as a compound mixture. The iden- 
tity of each compound in each well may be known or it may be unknown. There may be 
one or several compounds in each well. 

The screening procedure is carried out as follows: a report- V^i-n, which may be any 
appropriate yeast strain, but which preferably is one of the ?r#i%described in the sec- 
tion ^Reporter strains", is transformed with a bait construct and a prey construct using any 
of the standard procedures described above. A bait construct contains a nucleic acid se- 

20 quence encoding a bait polypeptide that is inserted into any of the bait vectors described 
in the section „bait vectors". A prey construct contains a nucleic acid sequence encoding 
a prey polypeptide inserted into any of the prey vectors described in the section „prey and 
library vectors". A prerequisite for the screening is that the bait and prey polypeptides 
interact in the reporter strain in a way such that the reassociation of CbM and NbM into 

25 split-ubiquitin takes place and that consequently, the reporter gene(s) is/are activated as 
described avbove. Transformants expressing bait and prey polypeptides are arrayed in 
96-weII or 384-well microtiter plates in selective medium. 

When using the reporter strain DSYDS1, selective medium is any minimal medium lack- 
30 ing leucine (to select for the presence of the bait plasmids), tryptophan (to select for the 
presence of the prey construct). The transformants are grown for 1-5 cell divisions in the 
selective medium. Compounds from any of the compound libraries described above are 
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added to the wells, together with the appropriate amount of 5-FOA (to select against the 
protein-protein interaction). Yeast transformants are grown and wells which contain viable 
yeast transformants are identified by measuring the optical density of the wells using 
standard procedures (Burke et al., 2000). 

5 

Two outcomes are possible: 

(1) The added compound does not inhibit the interaction between the bait and the prey 
polypeptide. In this case, the bait and prey polypeptides will be spatially close, reconstitu- 
tion of NbM and CbM will take place and split-ubiquitin will be formed. Since split-ubiquitin 

10 is formed, the transcriptional activator polypeptide will be cleaved off by ubiquitin-specific 
proteases and the transcriptional activator polypeptide will diffuse to the nucleus of the 
yeast cell. As a consequence, the URA3 reporter of the yeast transformant is activated 
and will produce the URA3 gene product. The URA3 gene product converts 5-FOA into a 
toxic metabolite that is used by the yeast cell and ultimately leads to cell death. No 

15 growth takes place in the well containing the particular yeast transformant and a de- 
crease in the optical density is meas&tftf. 

(2) The added compound does inhibit the interaction between the bait and the prey poly- 
peptide. In this case, the bait and prey polypeptides will not be spatially close, reconstitu- 

20 tion of NbM and CbM will not take place and split-ubiquitin will not be formed. Since split- 
ubiquitin is not formed, the transcriptional activator polypeptide will not be cleaved off by 
ubiquitin-specific proteases and the transcriptional activator polypeptide will remain at- 
tached to the bait polypeptide at the membrane. As a consequence, the URA3 reporter of 
the yeast transformant is not activated and no production of the URA3 gene product 

25 takes place. As the compound 5-FOA is not converted to its toxic metabolite, and be- 
cause 5-FOA on its own has no negative effect on the yeast cell, the yeast transformant 
will continue to grow in the selective media. The growth of the particular yeast transfor- 
mant is measured as an increase in optical density in the well. 

30 When using the reporter strain DSYDS2, selective medium is any minimal medium lack- 
ing leucine (to select for the presence of the bait plasmids), tryptophan (to select for the 
presence of the prey construct). The transformants are grown for 1-5 cell divisions in the 
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selective medium. Compounds from any of the compound libraries described above are 
added to the wells. Yeast transformants are grown and wells which contain viable yeast 
transformants are identified by measuring the optical density of the wells using standard 
procedures (Burke et al., 2000). 

Two outcomes are possible: 

(1) The added compound does not inhibit the interaction between the bait and the prey 
polypeptide. In this case, the bait and prey polypeptides will be spatially close, reconstitu- 
tion of NbM and CbM will take place and split-ubiquitin will be formed. Since split-ubiquitin 
is formed, the transcriptional activator polypeptide will be cleaved off by ubiquitin-specific 
proteases and the transcriptional activator polypeptide will diffuse to the nucleus of the 
yeast cell. As a consequence, the FAR1 reporter of the yeast transformant is activated 
and will produce the FAR1 gene product. The FAR1 gene product will interfere with pro- 
teins involved in the cell cycle of yeast and will lead to cell cycle arrest. No growth takes 
place in the well containing the particular yeast transformant and a no change in the opti- 
cal density h measured. 

(2) The added compound does inhibit the interaction between the bait and the prey poly- 
peptide. In this case, the bait and prey polypeptides will not be spatially close, reconstitu- 
tion of NbM and CbM will not take place and split-ubiquitin will not be formed. Since split- 
ubiquitin is not formed, the transcriptional activator polypeptide will not be cleaved off by 
ubiquitin-specific proteases and the transcriptional activator polypeptide will remain at- 
tached to the bait polypeptide at the membrane. As a consequence, the FAR1 reporter of 
the yeast transformant is not activated and no production of the FAR1 gene product takes 
place. Consequently, the yeast transformant continues to grow. The growth of the par- 
ticular yeast transformant is measured as an increase in optical density in the well. 

Screening for peptides that disrupt a defined protein-p rotein interaction 



Peptide aptamers are small proteins selected from combinatorial libraries that can bind to 
and modify the enzymatic activity of a certain protein, and thus represent a useful way for 
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manipulating protein function in vivo (Fields & Sternglanz, 1994, Yang et al., 1995). Such 
combinatorial peptide libraries were so far generated on the surface of filamentous phage 
or were synthesized chemically and coupled to a earner matrix such as resin beads. 

5 Peptide aptamers are produced from expression iibraries. A peptide aptmamer library is 
constructed by inserting nucleic acid fragements encoding defined or random peptides of 
variable lengths (which can be peptide fragements of 4-16 amino acids, or peptide frag- 
ments longer than 16 amino acids) into an appropriate expression vector, such as 
P426GAL (Mumberg et al, 1995) using standard procedures (Sambrook & Russell, 

10 2001). 

Peptide aptamers may be secreted from the yeast cell or may remain in the cytoplasm of 
the yeast cell. If the localization of the peptides is to be cytoplasmatic, no modifications 
are necessary. If the peptides are to be secreted, they are engineered to contain an N- 
15 terminal secretion signal, such as the leader sequence of the yeast invertase (SUC2) 
polypeptide. 

A nucleic acid sequence encoding the bait polypeptide and a nucleic acid sequence en- 
coding the prey polypeptide are inserted into the screening vector pDSdual-1 described in 
20 the section "bait vectors" using standard procedures (Sambrook & Russell, 2001) or the 
in vivo cloning procedure described above. 

A bait/prey construct, which is the construct pDSdual-1 expressing a bait and a prey 
polypeptide fused to CbM and NbM, respectively, which interact in yeast, and a peptide 
25 aptamer library are cotransformed into a yeast reporter strain using any of the methods 
described above. Preferably, the bait/prey construct and the peptide aptamer library are 
combined in the yeast reporter strain using the mating method described above. The 
yeast reporter strain can be any appropriate yeast strain but preferably is the strain 
DSYDS1 or the yeast strain DSYDS2. 
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When using DSYDS1, transformants are plated on selective medium lacking leucine (to 
select for pDSdual-1) and tryptophan (to select for the library construct encoding the pep- 
tide) and containing the compound 5-FOA. 

5 When using DSYDS2, transformants are plated on selective medium lacking leucine (to 
select for pDSdual-1) and tryptophan (to select for the library construct encoding the pep- 
tide). 

Outcome of the screen: 

10 (1) The peptide does not disrupt the binding of the bait polypeptide to the prey polypep- 
tide. In this case, the bait and prey polypeptides will be spatially close, reconstitution of 
NbM and CbM will take place and split-ubiquitin will be formed. Since split-ubiquitin is 
formed, the transcriptional activator polypeptide will be cleaved off by ubiquitin-specific 
proteases and the transcriptional activator polypeptide will diffuse to the nucleus of the 

15 yeast cell. As a consequence, the URA3 (DSYDS1) or the FAR1 (DSYDS2) reporter of 
the yeast transformant is activated and will produce the Ura3 or Far* $ene product. The 
Ura3 gene product converts 5-FOA into a toxic metabolite that is w^f-by the yeast cell 
and ultimately leads to cell death, whereas the Far1 gene product interferes with the cell 
cyle and arrests the yeast transformants. Consequently, the transformants do not grow 

20 on the selection plates. 

(2) The peptide does inhibit the interaction between the bait and the prey polypeptide. In 
this case, the bait and prey polypeptides will not be spatially close, reconstitution of NbM 
and CbM will not take place and split-ubiquitin will not be formed. Since split-ubiquitin is 

25 not formed, the transcriptional activator polypeptide will not be cleaved off by ubiquitin- 
specific proteases and the transcriptional activator polypeptide will remain attached to the 
bait polypeptide at the membrane. As a consequence, the URA3 (DSYDS1) or FAR1 
(DSYDS2) reporter of the yeast transformant is not activated and no production of the 
Ura3 or Far1 gene product takes place. Consequently, the yeast transformant continues 

30 to grow. Growth of particular transformants results in visible yeast colonies on the selec- 
tion plates. 



15 



20 



WO 03/083136 ^^CT/EP03/03287 

64 



Screening for scFvs that disrupt a defined protein-protein interaction 

Single-chain antibodies (scFvs), synthesized by the ceil and targeted to a particular cel- 
5 lular compartment, can be used to interfere in a highly specific manner with cell growth 
and metabolism. Like peptides, they may be secreted or may be intracellular (so-called 
intrabodies) (Richardson & Marasco, 1995). 

scFvs are produced from expression libraries. A scFv library is constructed by inserting 
10 nucleic acid fragements encoding a defined scFv framework and hypervariable regions 
into an appropriate expression vector, such as p426GAL (Mumberg et al., 1995) using 
standard procedures (Sambrook & Russell, 2001). The construction and use of scFv li- 
braries has been described numerous times, for example in connection with their use in 
phage display (Auf der Maur et al., 2001 , Boder & Wittrup, 1997, Gram et al., 1998). 



The screening procedure for scFvs is ider.tk;eVtb that described for peptides. t:f , 



Minimal interaction domain mapping 



This approach involves the identification of a minimal interaction domain (MID) of a bait or 
a prey protein, which is capable of blocking the interaction between said bait or prey pro- 
tein. First, a bait protein is expressed as fusion to CbM and a prey protein is expressed 
as fusion to NbM. Then, MID libraries of the bait and prey proteins are constructed by 

25 fragmentation of the cDNAs encoding those proteins into fragments of variable size and 
subcloning those fragments into a library vector. The library vector may be chosen so as 
to express the MIDs in the cytoplasm of the cell, or it may contain a secretion signal that 
is used to direct the export of the MIDs into the periplasmic space. Then, a screening ap- 
proach as described for peptide libraries may be used to identify those MIDs that block 

30 the interaction between the bait and the prey protein. The corresponding cDNA is then 
isolated and analysed further. 
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MID libraries are made by fragmentation of nucleic acids encoding (1) the bait polypep- 
tide (the sequence may be supplied as a PGR product amplified from any nucleic acid 
construct encoding the bait polypeptide or as the nucleic acid construct itself) and (2) the 
prey polypeptide (the sequence may be supplied as a PCR product amplified from any 
5 nucleic acid construct encoding the bait polypeptide or as the nucleic acid construct it- 
self). For each bait and prey polypeptide, a separate MID library is constructed. Alterna- 
tively, the libraries may be mixed or a mixed library consisting of bait and prey fragments 
may be made. The average size of the nucleic acid fragments in the library may be cho- 
sen to suit the experimental purposes. Fragments may range from 50 nucleotides up to 
10 5000 nucleotides but preferably, they are in the size range of 100-500 nucleotides or 500- 
1000 nucleotides. Fragments may be made by any appropriate method, but preferably, 
they are made by random shearing through sonification. After sonification, fragnments 
are repaired using standard procedures (Sambrook & Russell, 2001) and are inserted 
into an appropriate expression vector, such as p426GAL (Mumberg et al M 1995). 

15 

MID libaries nrvsiy b©- made to contain secreted MIDs or they may express the MIDs in the ^ 
cytoplasm. If WtDs are to be secreted, they are engineered to contain an N-terminal 
secretion signal, such as the leader sequence of the yeast invertase (SUC2) polypeptide. 

20 The nucleic acid sequences encoding the bait and prey polypeptides are cloned into the 
construct pDSdual-1, using any of the methods described. 

The screening procedure for MIDs is identical to that described for peptides. 

25 

Screening for antibody epitopes against a defined antigen 

scFVs may also be used to identify novel antibodies that bind to a given protein. In this 
case, scFv libraries are constructed where the scFvs are fused to a flexible polypeptide 
30 linker of 10-20 amino acids, followed by a transmembrane domain, followed by NbM. The 
actual construction of the scFv library follows standard methods. 
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Introduction of a library plasmid encoding a scFv fusion results in the expression of an 
integral membrane protein, where the scFv portion is located on the lumenal side of the 
membrane and the NbM is located on the cytoplasmatic side. 

5 A bait construct is made by inserting a nucleic acid fragment encoding the polypeptide to 
be used as antigen (e.g. the polypeptide against which the final antibody is directed) into 
any of the bait constructs using standard methods (Sambrook & Russell, 2001). 

The bait construct and the library are cotransformed into a suitable reporter strain using 
10 any of the methods described in the section ..library screening procedure". Preferably, bait 
and library are coexpressed using the mating procedure descibed in the secction ..library 
screening procedure". Any suitable reporter strain may be used, but preferably, DSY1 or 
DSY2 is used. 

15 The screening procedure follows that described for a conventional MbY2H screen out- 
lined in the section ..library screening procedure". 

Orphan G protein-coupled receptor screening 

20 

G protein coupled receptors (GPCRs) represent the single most important class of 
therapeutical targets today. From the complete human genome sequence, it is estimated 
that there are several hundred different G protein coupled receptors expressed in human 
tissues, the majority of which are uncharacterized to date. So-called orphan receptors are 
25 GPCRs for which no endogenous ligand has been identified. There is great pharmaceuti- 
cal interest in orphan receptors because they represent potentially novel therapeutic tar- 
gets (Wilson & Bergsma, 2000). 

Unactivated receptor (e.g. receptor that is not bound to an agonist or a ligand) is com- 
30 plexed with heterotrimeric G proteins which bind to the cytoplasmatic domains of the re- 
ceptor. Binding of an agonistic compound to a GPCR results in a conformational change 
of the receptor, which leads to the dissociation of the G alpha subunit from the receptor 
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and from the G beta and G gamma subunits. Both G alpha and G beta may then activate 
several signalling components that mediate downstream signalling (Gudermann et al. a 

1997) . 

5 Screening for ligands that bind to orphan receptors is an important field in pharmaceutical 
research. Today, most orphan receptor screens are carried out either in vitro, using par- 
tially purified membrane fractions containing the receptor under investigation, or in vivo, 
using mammalian cell lines that overexpress the GPCR in question. Both approaches 
have drawbacks: in vitro screening does not adequately represent the in vivo situation 

10 and is cumbersome and expensive due to the large amounts of partially purified receptor 
that have to be prepared. In contrast, in vivo screenings put the receptor in its natural 
setting, but the presence of other endogenous receptors with the identical or similar 
specificities may influence the screening results due to cross-talk between the receptors 
and their signalling cascades. Yeast represents an interesting alternative to the above- 

15 mentioned approaches. Screening is carried out in a ccellular setting, thus preserving 
most physiological parameters, but cross-talk between receptors is at iv^M since yeasty 
only expressed two GPCRs and only 3 G proteins, as composed to te* rkpdreds of dif- 
ferent GPCRs and dozens of G proteins that are expressed in mammalian cells. 

20 GPCRs have been sucessfully expressed in yeast and it most cases, the pharmacologi- 
cal fingerprint of the original receptor is preserved (Pausch, 1997, Reilander & Weiss, 

1998) . Although the glycosylation pattern in yeast is not identical to that of mammalian 
cells, it has been shown glycosylation is often not necessary for the specific binding of 
ligands by GPCRs (King et al., 1990). 

25 

In order to adapt the MbY2H system for the identification of ligands for orphan receptors, 
the following steps are necessary. 

Bait construction 

30 

A nucleic acid sequence encoding the GPCR under investigation is inserted into any of 
the bait constructs described using any of the standard methods described in the section 
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Jibrary screening". Preferentially, the nucleic acid sequence enccoding the bait receptor 
is inserted into pCMGA74B42. The signal sequence directing the correct insertion of the 
receptor into the membrane may be the original signal sequence of the receptor or pref- 
erentially, the original signal sequence of the receptor is replaced by a signal sequence 
5 derived from a yeast polypeptide such as the yeast GPCR Ste2p or yeast invertase 
(SUC2). 

Verification of correct expression 

10 Following transformation into an appropriate reporter strain, which may be any reporter 
strain listed in the section reporter strains" but which preferentially is DSY1 or DSY2 and 
most preferentially, DSYDS1 or DSYDS2, the receptor is expressed as a fusion to CbM 
and a reporter moiety, which may any of the reporter moieties described in the section 
„bait vectors" but which preferably is Gal1-93-B42 and most preferably, Gal1-74-B42. 

15 Verification that the receptor is correctly expressed and integrated into the membranes 
may be carried out using any of the methods 'bribed in the section Jibrary screening". ; 

Prey construction 

20 For the purpose of an orphan receptor ligand screen, the prey is defined as any protein 
that interacts with the receptor under investigation in a manner such that the interaction is 
abolished upon ligand binding. Preferentially, the prey is any G alpha subunit or any G 
beta subunit of a heterotrimeric G protein complex and most preferentially, it is a promis- 
cous G alpha subunit such as human G alpha 16 or murine G alpha 13. The nucleic acid 

25 sequence encoding the G protein may be inserted into any of the library or prey con- 
structs described in the section „prey and library vectors" but preferentially, it is inserted 
into p424CUP1-NbM-x or p424CUP1-x-NbM. The constructs p424CUP1-NbM-x or 
p424CUP1-x-NbM are identical to p424NbM-x and p424x-NbM, respectively, except that 
the CYC1 promoter has been replaced by a CUP1 promoter. The G protein may either be 

30 fused N-terminally or C-terminally to NbM. The correct expression of the prey may be 
verified using any of the standard methods described in the section Jibrary screening". 
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Screening 

Screening is initiated by cotransforming bait and prey constructs into any of the reporter 
strains described. Following transformation, the yeast transformants are expanded in se- 

5 lective medium lacking the amino acids leucine (to select for the presence of the bait 
construct) and tryptophan (to select for the presence of the prey construct). Following 
expansion, liquid yeast cultures are aliquoted into 96-well or 384-well plates. Expression 
of the bait and prey polypeptides are induced by the addition of copper to the medium, 
and directly afterwards, a particular compound from a compound library is added to each 

10 well. When using DSYDS1, the compound 5-FOA is added together with the copper to 
the medium. The compound library may be any of the compound libraries described 
above or it may be any compound library that has been manufactured expressively for 
the purpose of orphan receptor ligand screening. Aliquoting the yeast strain, addition of 
copper and addition of compounds is done in a automated fashion using standard high 

15 throughput screening methodology. 

The following outacmm are possible: • 

(1) The added compound does not bind the receptor. In this case, no conformational 
20 change in the receptor takes place and the majority of all heterotrimeric G proteins will 
remain bound to the receptor. The spatial proximity of G alpha and the GPCR leads to 
the reconstitution of NbM and CbM and the formation of spiit-ubiquitin. Since split- 
ubiquitin is formed, the transcriptional activator polypeptide will be cleaved off by ubiq- 
uitin-specific proteases and the transcriptional activator polypeptide will diffuse to the nu- 
25 cleus of the yeast cell. As a consequence, the URA3 (DSYDS1) or the FAR1 (DSYDS2) 
reporter of the yeast transformant is activated and will produce the Ura3 or Far1 gene 
product. The Ura3 gene product converts 5-FOA into a toxic metabolite that is used by 
the yeast cell and ultimately leads to cell death, whereas the Far1 gene product interferes 
with the cell cyle and arrests the yeast transformants. Consequently, the transformants 
30 do not grow in the selection medium. 
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(2) The added compound binds and activates the receptor. The conformational change 
following receptor activation leads to the dissociation of the G proteins from the receptor. 
In this case, the bait (receptor) and prey (G alpha or G beta) polypeptides will not be spa- 
tially close, reconstitution of NbM and CbM will not take place and split-ubiquitin will not 

5 be formed. Since split-ubiquitin is not formed, the transcriptional activator polypeptide will 
not be cleaved off by ubiquitin-specific proteases and the transcriptional activator poly- 
peptide will remain attached to the bait polypeptide at the membrane. As a consequence, 
the URA3 (DSYDS1) or FAR1 (DSYDS2) reporter of the yeast transformant is not acti- 
vated and no production of the Ura3 or Far1 gene product takes place. Consequently, the 

10 yeast transformant continues to grow. Growth of particular transformants in the selection 
media is then measured by measuring the optical density of the yeast culture. 

Modification 1 

In a modification of the procedure described above, the receptor can be expressed as an 
15 unfused polypeptide from any suitable yeast expression vector (such as p424GAL1 or 
P424CYC1, Mumberg et a/., 1995) and the bait and prey polypeptides are expressed 
from pDSdual-2. The construct pDSdual-2 is identical to pDSdual-1 described in the sec- 
tion „bait vectors", except that the CYC1 promoters have been replaced by CUP1 pro- 
moters. In this modification, the bait is any G alpha subunit, but preferably a promiscous 
20 alpha subunit such as human G alpha 16, and the prey is any G beta subunit. 

The expression, the verification of expression and the screening are carried out exactly 
as described above. In this case, binding of a ligand to the receptor induces dissociation 
of the G protein complex and therefore leads to the spatial separation of G alpha and G 
25 beta. Spatial separation of G alpha and G beta prevents formation of split-ubiquitin by 
NbM and CbM and prevents expression of URA3 (DSYDS1) or FAR1 (DSYDS2), leading 
to growth of the transformant in selective medium. 

Modification 2 

30 In another modification of the procedure described above, the receptor is expressed as 
an unfused polypeptide from any suitable yeast expression vector (such as p424GAL1 or 
p424CYC1, Mumberg et a/., 1995) and the bait and prey polypeptides are expressed 
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from pDSdual-2. The bait is adenylate cyclase and the prey is any G alpha subunit, but 
preferably a promiscous alpha subunit such as human G alpha 16,. For screening any 
suitable yeast reporter strain can be used, but preferably, L40, AH109 or PJ69-4A and 
most preferably, DSY1 or DSY2 is used. 

5 

The expression, the verification of expression and the screening are carried out exactly 
as described above, except that the selective medium is lacking leucine (to select for 
pDSdual-2, tryptophan (to select for the expression construct encoding the receptor) and 
adenine and histidine. The stringency of the screen may be varied by supplementing ei- 
10 ther adenine or histidine, as described in the section Jibrary screening procedure". 

The following outcomes are possible: 

(1) The added compound does not bind the receptor. In this case, no conformational 
15 change in the receptor takes place and the majority of all heterotrimeric G proteins will 
remain bound to the receptor. If G alpha remains bound to the receptor, ifr : r^not reach^f , 
adenylate cyclase and consequently, NbM and CbM are not. spatially ^clo^#m,no split-. 
ubiquitin is formed. Since no split-ubiquitin is formed, the transcriptional activator poly- 
peptide will not be cleaved off by ubiquitin-specific proteases and the transcriptional acti- 
20 vator polypeptide is unable to reach the nucleus of the yeast cell. As a consequence, the 
reporter genes of the yeast transformant are not activated and will not produce any re- 
porter protein. Therefore, the respective yeast transformant will be unable to grow in the 
selective medium. 

25 (2) The added compound binds and activates the receptor. The conformational change 
following receptor activation induces dissociation of G alpha from G beta and G gamma, 
followed by binding of G alpha to adenylate cyclase. The cloese spatial proximity of G 
alpha and adenylate cyclase brings NbM and CbM into close proximity, leading to the 
reconstitution of split-ubiquitin. Since split-ubiquitin is formed, the transcriptional activator 

30 polypeptide will be cleaved off by ubiquitin-specific proteases and the transcriptional acti- 
vator polypeptide will diffuse to the nucleus of the yeast cell. As a consequence, the re- 
porter genes of the yeast transformant are activated and will produce the reporter protein. 
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Therefore, the respective yeast transformant will be able to grow in the selective medium. 
Growth of particular transformants in the selection media is then measured by measuring 
the optical density of the yeast culture. 

5 Modification 3 

This modification is used to specifically select compounds that bind one GPCR but a re- 
lated GPCR. For example, GPCR 1 (the bait) may be the beta2 adrenergic receptor and 
GPCR 2 (the control) may be the betal adrenergic receptor. The use of modification 3 in 
screening will lead to the isolation of compounds that selectively activate the beta2 adre- 
10 nergic receptor but not the betal adrenergic receptor. The identification of novel com- 
pounds that show no unspecific cross-talk between GPCRs of the same subclass or even 
between unrelated GPCRs is of enormous importance for the pharmaceutical industry. 

In modification 3 the nucleic acid sequence encoding a G alpha subunit, which may be 
15 any G alpha subunit but which preferably is human G alpha 16, is inserted into the con- 
struct pDS-INbM such that a fusion of G alpha 1 0 un^NbM results. > m 

pDS-INbM is an integrative vector carrying the following expression cassette: the cas- 
sette contains a CYC1 promoter for low level expression in yeast, followed by a multiple 
20 cloning site containing multiple recognition sites for restriction endonucleases and se- 
quences for the in vivo cloning described in the section Jibrary screening procedure", 
followed by the sequence encoding NbM, followed by a CYC1 terminator. 

Alternatively, the cassette may contain a CUP1 promoter instead of a CYC1 promoter to 
25 allow the inducible expression of the fusion polypeptide. 

The backbone of the construct contains elements necessary for the integration of the 
construct into the yeast genome, such as a kanMX cassette and appropriate homology 
regions. Furthermore, the vector contains the kanamycine resistance cassette for selec- 
30 tion in E. co//, and the pUC origin of replication for propagation in E. coli. 



This vector is suitable for the low level or inducible expression of a NbM-fused polypetide. 
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The construct encoding adenylate cyclase and a G alpha subunit in pDS-ldual-1 is inte- 
grated into the genome of an appropriate reporter strain using standard methods (Burke 
et at., 2000), which may be any reporter strain but which preferably is DSYDS1 or most 
5 preferably DSYDS2. 

The nucleic acid sequence enoding the GPCR for which ligands are to be found (termed 
GPCR 1) is inserted into any of the bait vectors described in section 2, but preferably into 
pCUP1-CbM-DB42, which is identical to pCUP1-CbM-TDA, but where the nucleic acid 
10 sequence encoding VP16 has been replaced by the nucleic acid sequence encoding B42 
and where the LEU2 marker has been replaced by TRP1 marker. When transformed into 
yeast, this construct expresses a fusion polypeptide betwen GPCR 1 and LexM-B42. 

The nucleic acid sequence enoding the GPCR which must not bind ligands that bind to 
15 GPCR1 (termed GPCR 2) is inserted into any of the bait vectors described in section 2, 
but preferably into pC<£4.™4842 -or pCGA93B42. When transformed into yeast, this con^. 
struct expresses a fu^-^typeptide betwen GPCR 2 and Gal4 amino acids 1-74-B42. 

The yeast reporter strain carrying the integrated construct is transfomed with the two 
20 constructs encoding GPCR GPCR 2 baits and grown in selective medium lacking the 
amino acids tryptophan (to select for the construct encoding GPCR 1) and leucine (to 
select for the construct encoding GPCR 2). 

Yeast transformants are aliquoted into 96-well or 384-well plates and selective medium 
25 lacking the amino acids leucine, tryptophan and the compound adenine (to select against 
activation of GPCR 2) compound is added. Following the addition of the compound, cop- 
per is added to induce expression of the GPCRs and if DSYDS1 is used, 5-FOA is 
added. 

30 Selection is carried out as described above. 



The following outcomes are possible: 
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(1) The added compound does not bind the GPCR 1. In this case, no conformational 
change in the receptor takes place and the majority of all heterotrimeric G proteins will 
remain bound to the receptor. The spatial proximity of G alpha and the GPCR leads to 

5 the reconstitution of NbM and CbM and the formation of split-ubiquitin. Since split- 
ubiquitin is formed, LexA-B42 will be cleaved off by ubiquitin-specific proteases and will 
diffuse to the nucleus of the yeast cell. As a consequence, the URA3 (DSYDS1) or the 
FAR1 (DSYDS2) reporter of the yeast transformant is activated and will produce the Ura3 
or Far1 gene product. The Ura3 gene product converts 5-FOA into a toxic metabolite that 

10 is used by the yeast cell and ultimately leads to cell death, whereas the Far1 gene prod- 
uct interferes with the cell cyle and arrests the yeast transformants. Consequently, the 
transformants do not grow in the selection medium. 

(2) The added compound binds to GPCR1 and not to GPCR 2. The conformational 
15 change following receptor activation leads to the dissociation of the G proteins from 

^?CR 1. In this case, the bait (receptor) and prey (G alpha) polypeptides will not be,spa- ; 
i r>i;y close, reconstitution of NbM and CbM will not take place and split-ubiquitin will not 
be formed. Since split-ubiquitin is not formed, LexM-B42 will not be cleaved off by ubiq- 
uitin-specific proteases and will remain attached to the bait polypeptide at the membrane. 

20 As a consequence, the URA3 (DSYDS1) or FAR1 (DSYDS2) reporter of the yeast trans- 
formant is not activated and no production of the Ura3 or Far1 gene product takes place. 
Consequently, the yeast transformant continues to grow. Since GPCR 2 is not activated 
by the compound, G alpha-NbM remains bound to GPCR 2, the close spatial proximity of 
CbM and NbM leads to their association to form split-ubiquitin and the Gal-B42 polypep- 

25 tide is cleaved off and activates the ADE2 reporter. Therefore, the transformants survive 
selection in the selection medium lacking adenine. Growth of particular transformants in 
the selection media is then measured by measuring the optical density of the yeast cul- 
ture. 

30 (3) The added compound binds to GPCR 1 (with the outcome described in (2)) but also to 
GPCR 2. The conformational change in GPCR 2 leads to the dissociation of G alpha- 
NbM from GPCR 2. Since the ubiquitin-specific proteases do not recognize the CbM moi- 
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ety alone, the Gal-B42 polypeptide is not cleaved off. Consequently, the ADE2 reporter is 
not activated and the transformant does not grow in the selection medium. 

5 Examples 
Example 1 

MbY2H screen using the beta2 adrenergic receptor as a bait 

10 

Bait cloning 

The bait vector pCbM-TDA described in the section „bait vectors" was modified by inser- 
tion of the leader sequence from the yeast STE2 gene into the Xba I and Pst I sites. The 
15 sequence of the leader was exactly as described by (King et al M 1990). The coding se- 
quence of the human beta2 adrenergic receptor was amplified from human ger^^MA^.. 
(Promega Corporation, Madison, Wl, USA) and cloned into the vector pSte-C^|#:^^ -m. 

Verification of expression 

20 

Correct expression of the receptor was verified by western blotting using an antibody di- 
rected against the VP16 domain (Clontech). In the yeast strain L40 expressing the con- 
struct pBAR-CbM-TDA, a double band at 75 and 100 kDa was visible (Figure 9A). No 
bands were visible in control extracts of L40 transformed with a vector supplying the 
25 LEU2 resistance marker. The presence of the receptor in membranes of L40 was verified 
using a standard membrane fractionation method as described in the section „library 
screening procedure". As shown in Figure 9B, the receptor was only found in the mem- 
brane fraction, but not in the soluble fraction. 



30 Self-activation test 
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Self-activation of the receptor-LexA-VP16 fusion was assessed by testing growth of 
transformants on selective medium lacking the amino acids histidine, leucine and trypto- 
phane and containing 15 mM 3-AT and by assaying beta-galactosidase activity using a 
standard filter test. Transformants expressing the receptor showed no activity in the beta- 
5 galactosidase assay. 

Screening against a NbM-x library 

A screen was carried out using a human brain NbM-x library constructed by Life Tech- 
10 nologies (Carlsbad, CA, USA). Cotransformation and growth on selective plates was 
performed as described in the section Jibrary screening procedure". Selection yielded 
approximately 300 colonies. Following a beta-galactosidase filter assay on the primary 
selection plates, 96 colonies showing strong induction of beta-galactosidase activity were 
picked and restreaked on selective media. Following a second beta-galactosidase filter 
15 assay, 59 positive clones were processed for western blotting and plasmid isolation as 
described in the section Jibrary screening procedu^': Library plasmids were reintro^ 
duced into L40 together with either pBAR-CbM-TB^ m^ric - control construct pErbB3^ 
TDA, encoding the human ErbB3 receptor fused to CbM-LexM-VP16, and assayed for 
growth on selective plates and beta-galactosidase activity. Of 59 clones, 19 scored as 
20 bait-dependent, i.e. they interacted with the beta2 adrenergic receptor but not with the 
ErbB3 receptor. Plasmids from bait dependent clones were sequenced 

Example 2 

25 

Filter assay for the detection of p-galactosidase activity. 

1 . The yeast expressing Y-CbM-TDA are grown together with NbM-fusion proteins for two 
days at 30°C on sterile Whatman filters on drop-out agar plates lacking leucine and tryp- 
30 tophan. The drop-out-medium is used because cells tend to grow poorly in standard 
minimal medium. 
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2. Using forceps, the filter is transferred and dipped into liquid nitrogen for 3 min and al- 
lowed to thaw at room temperature. 

3. The filters are overlaid with 1.5% agarose in 0.1 M NaP0 4 -buffer (pH 7.0) containing 
5 0.4 mg/ml X-gal. 

4. The filters are incubated at 30°C for 0.4-24 hours. 



10 Example 3 

Quantitation of p-galactosidase activity. 

1. Yeast transformants expressing Y-CbM-TDA are inoculated together with NbM-fusion 
15 proteins into 3 ml of liquid drop-out medium lacking uracil, leucine and tryptophan. 

2. Incubation at 30°C until eu£#9S reach midlog phase (OD546 ~1.0) 

3. Cells are pelleted from 1 ml of culture, washed once in Z buffer, and resuspend in 300 
20 \i\ Z-buffer. 

4. 100 ^ cells are taken and lysed by 3 freeze/thaw cycles. 

5. 700 nl Z-buffer containing 0.27 % (v/v) p-mercaptoethanol and 160 fxl ONPG (4 mg/ml 
25 in Z-buffer) are added and incubated for 1-20 hours at 30°C. 

6. 400 jil 0.1 M NaC0 3 are added, the samples centrifuged, and the OD420IS measured. 



30 



7. The p-galactosidase activity is calculated using the formula: 

p-galactosidase units = 1 000 x OD 42 o/(OD 5 46 x min) 
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Example 4 

Western blot analysis of cells expressing Y-CbM-TDA together with NbM-fusions 

5 

1. Yeast cells expressing Y-CbM-TDA together with NbM-fusion proteins are grown at 
30°C to an OD 54 6 of 0.3-1.2 in drop-out liquid medium lacking leucine and tryptophan. 

2. The cells are pelleted and resuspended in 50 p.l 1.85 M NaOH per 3 OD units of cells, 
10 and incubated on ice for 1 0 min. 

3. The same volume of 50 % trichloroacetic acid is added, and the proteins are precipi- 
tated by centrifugation for 5 min. 

15 4. The pellet is resuspended in 50 [i\ of SDS-sample buffer containing 8 M urea. 

jil of 1 M Tris.base is added and the protein is dissolved at 37°C (heating to 95°G 
sometimes results in the clumping of membrane proteins). 

20 6. The samples are centrifuged for 2 min. and 10 pJ extract is used for SDS- 
PAGE/Western blotting analysis. 

7. The amount of protein loaded by Coomassie staining of the SDS-gels is verified. 

25 8. The membranes are probed with peroxidase-IgG at 1:5000 dilution. Protein A-fusion 
proteins are detected by enhanced chemiluminescence (Pierce of Amersham). 

Example 5 

30 

Interactions between heterologous proteins in yeast can be detected using the MbY2H 
system 
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To demonstrate the usefulness of the MbY2H system in detecting protein-protein interac- 
tions between two heterologous proteins situated at the membrane in yeast, two defined 
protein pairs were selected. In the first example, the homodimerization of presenilin-1 N- 
terminal fragments is demonstrated and serves as an example of an interaction between 
two integral membrane proteins. In the second case, the interaction between the receptor 
tyrosine kinase ErbB3 and its adaptor protein Nrdpl was chosen to demonstrate the in- 
teraction between an integral membrane protein and a soluble (cytosolic) protein. 

Definition of bait and prev vectors 

1 . pCMBVL The vector pCMBVI corresponds to the vector pCbM-TDA described on 
page 21, except that the sequence encoding the 3xFLAG epitope has been re- 
moved (Figure 13A). 

2. pAMBVL The vector pAMBVI is identical to pCMBVI , except that the weak CYC1 
promoter has been replaced by a strong ADH1 promoter (Figure 13B). 

3. pDSL-Nx. The vector pDSL-Nx corresponds to the vector p424NbM-X desc3&^% 
on page 37 (Figure 14 A). 

4. dADSL-Nx. The vector pADSL-Nx corresponds to the vector pNbM-HA-X de- 
scribed on page 36 (Figure 14 B). 

5. pDSL-xN. The vector pDSL-xN corresponds to the vector p424X-NbM described 

on page 37 (Figure 14 C). 

6. oADSL-xN. The vector pADSL-xN corresponds to the vector pX-HA-NbM de- 
scribed on page 36 (Figure 14 D). 

Homodimerization of presenilin-1 N-termin al fragments 

Presenilin-1 is a polytopic membrane protein with eight transmembrane domains. In its 
mature form, the protein is a dimer of an N-terminal (NTF) and a C-terminal (CTF) frag- 
ment which are generated from intact presenilin-1 by endoproteolytic cleavage in a cyto- 
solic loop located between transmembrane domains 6 and 7. The two fragments remain 
associated at a stoichometric ratio of 1:1 and form a very stable complex. Currently, the 
precise function of presenilin-1 is still under debate but it has been shown numerous 
times that the protein is an essential part of a multiprotein complex termed gamma- 
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secretase. The gamma-secretase complex is involved in proteolytic processing of a num- 
ber of transmembrane proteins, among them APP (amyloid precursor protein) and mem- 
bers of the Notch family (Esler and Wolfe, 2001). 

The homodimerization of NTFs has been demonstrated previously using the Split- 
ubiquitin method described by Stagljar and coworkers (Stagljar et al., 1998; Cervantes et 
al., 2001). Therefore this interaction was chosen to investigate whether it can also be 
detected using the novel vectors containing different promoters and the LexM moiety. 
Figure 10 A shows a schematic description of bait and prey in the MbY2H system. The 
interaction between NTF bait and NTF prey should lead to the reconstitution of split- 
ubiquitin from Cub and NubG and consequent cleavage of the peptide bond between Cub 
and LexM-VP16 (Rep in Figure 10 A). Transport of LexM-VP16 to the nucleus and its 
binding to LexA operators then activates the reporter genes, whose readout is measured 
either by growth on selective media or by assaying beta-galactosidase activity. Figure 10 
B shows the outcome of the experiment. Yeast reporter strain L40 was cotransformed 
with different baits and preys and transformed cells were plated on minimal medium se- h . 
lecting for the presence of both plasmids. Following &&*&\ *?< 30°C for 3 days, 10 colo- 
nies of each transformant were resuspended in 0.9% NaCI and aliquots were spotted 
onto minimal medium to select for protein-protein interactions. Growth on selective me- 
dium indicates that the bait and prey interact. Low level expression of bait and prey from 
a weak CYC1 promoter (pCMBV1-NTF and pDSL-NubG-NTF) does not allow for the de- 
tection of the interaction, whereas increasing the expression level of the bait (pAMBVI- 
Isitf and pDS L-NubG-NTF) leads only to modest growth on selective medium. Expres- 
sion of the prey at high levels (pADSL-NubG-NTF) is sufficient to detect the interaction 
both with low level and high level expression of the bait. Neither bait interacts with a non- 
cognate control prey expressed at high levels (pMBV1-Alg5-NubG). 

Interaction of ErbB3 with the C-ter minal part of NrdPl 

The receptor tyrosine kinase ErbB3 is a member of the family of Erb receptors, which 
also contains ErbB1 (also called EGFR, epidermal growth factor receptor), ErbB2 and 
ErbB4. They mediate diverse signals influencing essential cellular responses such as 
differentiation, proliferation and survival (Olayioye et al., 2000). Nrdpl is a ring finger 
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containing protein whose C-terminal domain has been found to interact with the cytosolic 
tail of ErbB3 in a yeast two-hybrid assay (Qiu and Goldberg, 2002). 

To assay the interaction between ErbB3 and Nrdpl bait and prey constructs were trans- 
5 formed into the yeast strain L40 as indicated in Figure 11, spotted onto selective plates 
and grown at 30°C for 3 days. Strong growth on selective medium lacking histidine is 
seen for ErbB3 and Nrdpl when expressed at high levels (Figure 1 1 a), whereas neither 
ErbB3 in combination with the control prey expressed from pADSL-Alg5 (Figure 11c) nor 
Nrdpl in combination with the control bait expressed from pMBV1-Alg5 (Figure 11 b) 
10 show any growth on selective medium. Again, this demonstrates that high level expres- 
sion of bait and prey in the MbY2H system allows the detection of an interaction between 
an integral membrane protein and its cytosolic binding partner. 

15 Example 6 

Self-activation levels of conste&& searing the wild type and mutated LexA sequences <g* . 

To demonstrate that screening for protein-protein interactions is not possible using the 
20 vectors described in Stagljar et ai (1998) due to the inherent self-activation of the bait- 
LexA-Cub-ProteinA-VP16 fusion proteins, self-activation levels of constructs bearing the 
wild type and the mutated LexA sequence were compared when cotransformed with non- 
cognate (non-interacting) preys. Coexpression of the beta2-adrenergic receptor bait in 
the vector pCAS (carrying the wild type LexA sequence) together with either a non- 
25 interacting control prey encoding the yeast protein Ost1 fused to NubG or a control prey 
encoding only NubG with a membrane anchor resulted in growth on selective medium 
and strong beta-galactosidase activity (Table 1, rows 1 and 2). In contrast, the beta2- 
adrenergic receptor bait expressed from the vector pCbM-TDA (carrying the LexM se- 
quence with the R157G mutation to decrease the strength of the nuclear localization sig- 
30 nal) did not grow on selective medium and did not show any beta-galactosidase activity 
when coexpressed with either of the two control preys (Table 1, rows 3 and 4). Thus, 
while the high background of the original bait vector described by Stagljar et ai (1998) 
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would result in an unacceptably high background of false positives in a MbY2H screen, 
the novel vectors described here decrease this background significantly to allow screen- 
ing of an integral membrane protein bait against cDNA or genomic libraries. 



ROW; 


Bait' ;J 


Prey 

• . 


Colonies on 
SD-HTL 


iacfovity 


1 


pCAS-bAR 


pOstl-NubG 


6 


very strong 


2 


pCAS-bAR 


pMP-NubG 


200 


very strong 


3 


pBAR-LM1 


pOstl-NubG 


0 


negative 


4 


PBAR-LM1 


pMP-NubG 


0 


negative 



5 



Table 1. Comparison of self-activation levels of wild type and mutated LexA bait 
constructs. The entire open reading frame of the beta2-adrenergic receptor (beta2-AR) 
was cloned into several bait vectors to yield the following constructs: (1) pCAS-bAR: 

10 beta2-AR inserted into the vector pCAS carrying wild type LexA. (2) pBAR-LM1: beta2- 
AR inserted into pCbM-TDA containing the R157G mutation in LexA to decrease the 
strength of nuclear localization signal. The following prey constructs were used: (1) 
pOstl-NubG: the entire open reading frame of the yeast protein Ost1 inserted into pX- 
HA-NbM. (2) pMP-NubG: a signal conferring fatty acid modification inserted into pX-HA- 

15 NbM. pursuit combinations of bait and prey constructs were cotransformed into the 
yeasi; "J L40 and growth was assayed on minimal medium lacking the amino acids 
trypiop^fev leucine and histidine. The bait construct containing the wild type LexA: 
showed activated both reporter genes in the absence of a valid protein-protein interac- 
tion, resulting in growth on SD-HTL and strong Beta-galactosidase activity. In contrast, an 

20 identical construct bearing the mutated LexA sequence did not show any activation of the 
reporter genes in the absence of a protein-protein interaction. 

Example 7 



25 MbY2H screen using a modified bait vector and the beta2-adrenergic receptor as a bait 



The screen was performed using a human brain NubG-x cDNA library. The beta2- 
adrenergic receptor was inserted into the bait vector pCbM-TDA carrying the LexM se- 
quence (LexA with the R157G mutation to decrease the strength of the nuclear localiza- 
30 tion signal). Library scale transformation and growth on selective medium was performed 
as described in the section "library screening procedure". Selection for histidine prototro- 
phy and beta-galactosidase activity yielded 122 colonies. When restreaked on selective 
medium, 103 of the 122 colonies showed reproducible histidine prototrophy and beta- 
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galactosidase activity. The positive 103 colonies were processed by Western blotting and 
plasmid isolation, and inserts were analyzed by 5'- sequencing. 

Eleven of the sequenced clones encoded ATPases (four proton transporting and seven 
calcium transporting ATPases). Seven clones encoded potassium voltage-gated chan- 

5 nels and five clones encoded NADH-dehydrogenase subunits. The fact that multiple 
clones were found that encode similar classes of proteins was taken as evidence that 
these clones indeed encode putative interactors of the beta2-adrenergic receptor. Fur- 
thermore, all clones encode integral membrane proteins. A bait-dependency test was 
carried out on a total of 16 clones, using the endogenous yeast membrane proteins Alg5 

10 and Ost1 as control. 13 clones were found to be bait dependent (Figure 12). 
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CLAIMS 

1 . A method for detecting an interaction between a first membrane bound protein or 
part thereof and a second protein or part thereof which is either membrane bound 
or soluble, the method comprising: 

(a) providing a host cell containing at least one detectable gene (reporter gene) 
having a binding site for a transcriptional activator, such that the detectable 
gene expresses a detectable product, preferably a protein, when the detect- 
able gene is transcriptionally activated; 

(b) providing, as part of a bait vector, a first chimeric gene capable of being ex- 
pressed in said host cell, the first chimeric gene coding inter alia for a first 
membrane protein or part thereof which gene is attached to the DNA- ^ 
^uence of a first module encoding inter alia a first protein sequence in- 
volved in intracellular protein degradation and a transcriptional activator, 
said first protein or part thereof to be tested whether it can interact with a 
second protein or part thereof; 

(c) providing, as part of a prey vector, a second chimeric gene capable of being 
expressed in said host cell, the second chimeric gene coding inter alia for a 
second protein or part thereof which is either membrane bound or soluble 
and which gene is attached to the DNA sequence of a second module en- 
coding inter alia a second protein sequence involved in intracellular protein 
in degradation; 

(d) introducing the bait vector and the prey vector into the host cell such that an 
interaction between the expressed first and second protein and/or their parts 
can take place, which interaction leads to an interaction of the first protein 
sequence of the first module and the second protein sequence of the sec- 
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ond module which interaction in turn leads to activation of an intracellular 
protease and proteolytic separation of the transcriptional activator, at least 
one of the bait vector and prey vector, preferably both, being suitable for 
being maintained episomally; 

(e) determining whether the detectable gene of the host cell has been activated 
by the transcriptional activator. 

2. The method according to claim 1 , wherein the host cell is a yeast, a bacterial or a 
mammalian cell. 

3. The method according to claim 2, wherein the host cell is a Saccharomyces pom- 
be cell or, more preferably, cells of the budding yeast Saccharomyces cerevisiae. 

4. The method according to any of claims 1 to 3, wherein the detectable gene is 
activated by a natural or artificial activator, preferably an activator comprising a 
short tagging module. 

5. The method according to any of claims 1 to 4, wherein the detectable gene is 
activated by the artificial transcriptional activator protein A-LexA-V16 (PLV). 

6. The method according to any of claims 1 to 5, wherein the first protein sequence 
comprises a C-terminal portion of ubiquitin (Cub) or a mutant thereof (CbM) and 
the second protein sequence comprises an N-terminal portion of ubiquitin (Nub) 
or a mutant thereof (NbM). 

7. - The method according to any of claims 1 to 6, wherein the DNA-sequence coding 

for the first membrane protein is selected from the group consisting of any bacte- 
rial membrane protein, any viral membrane protein, any oncogene-encoded 
membrane protein, any growth factor receptor or any eukaryotic membrane pro- 
tein, or parts thereof. 
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The method according to any one of the preceding claims wherein the second 
membrane protein or the soluble protein, or part thereof, is encoded by a library 
of plasmids. 

A kit for detecting binding between a first membrane bound protein or part thereof 
and a second protein or part thereof which is either membrane bound or soluble 
comprising: 

(a) a host cell containing at least one detectable gene (reporter gene) having a 
binding site for a transcriptional activator, such that the detectable gene ex- 
presses a detectable product, preferably a protein, when the detectable ge- 
ne is transcriptionally activated; 

(b) a first vector (bait) comprising a first site capable of receiving a first nucleic 
acid coding for a first membrane protein or part thereof such that when the 
first nucleic acid is inserted it becomes attached to the ( ;Kvsequence of a if 
first module encoding inter alia a first protein sequence Solved in intracel- 
lular protein degradation, the module further comprising a nucleic for a 
transcriptional activator; 

(d) a second vector (prey) comprising a second site capable of receiving a se- 
cond nucleic acid coding for a second membrane protein or a soluble pro- 
tein or part thereof such that when the second nucleic acid is inserted it be- 
comes attached to the DNA sequence of a second module encoding inter a- 
lia a second protein sequence involved in intracellular protein degradation; 
and optionally 

(e) a plasmid library encoding second proteins or parts thereof. 

The kit according to claim 9, wherein the host cell is a yeast, bacterial or mamma- 
lian cell. 
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1 1 . The kit according to claim 10, wherein the host cell is a yeast cell, preferably of 
Saccharomyces pombe and most preferably of the budding yeast Saccharomy- 
ces cerevisiae. 

12. The kit according to any of claims 9 to 1 1 , wherein the detectable gene can be 
activated by a natural or artificial activator, preferably an activator comprising a 
short tagging module. 

13. The kit according to any of claims 9 to 12, wherein the detectable gene can be 
activated by the artificial transcriptional activator Protein A-LexA-V16 (PLV). 

14. The kit according to any of claims 9 to 13, wherein the first protein sequence 
contains the C-terminal part of ubiquitin (Cub) or a mutant thereof and the second 
protein sequence contains the N-Terminal part of ubiquitin (Nub) or a mutant the- 
reof. 

15. The kit according to any of clal.m^ ^ 14, wherein the promoter is selected from r 
the group consisting of ADH promoter, CYC1 promoter or TEF1 promoter. 

16. The kit according to any of claims 9 to 15, wherein the DNA-sequence coding for 
the first membrane protein is derived from any bacterial membrane protein, any 
viral membrane protein, any oncogene-encoded membrane protein, any growth 
factor receptor or any eukaryotic membrane protein, or parts thereof. 

17. The kit according to any of claims 9 to 16, wherein the DNA-sequehce coding for 
the sectihd protein is contained in a library of plasmids. 

18. Vector useful as a bait vector in a method according to any of claims 1 to 8 or a 
kit according to any of claims 9 to 17 comprising the following elements: 

(1) a selection marker for propagation of the vector in E co//; 

(2) an origin of replication which allows propagation of the vector in E. co//; 
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(3) a further selection marker for propagation of the vector in yeast; 

(4) an origin of replication which allows propagation of the vector in yeast; and 

(5) an expression cassette comprising the following elements: 

(a) a promoter element conferring low level expression or inducible 
expression or high level expression; 

(b) a nucleic acid sequence encoding a leader; 

(c) a site capable of receiving a nucleic acid encoding a protein; 

(d) a nucleic acid sequence encoding the Oterminal part of yeast 
ubiquitin (Cub) or mutated yeast ubiquitin (CbM); 

(e) a nucleic acid sequence encoding a tag that allows detection of the 
fusion protein; 

(f) a nucleic acid sequence encoding a DNA binding protein; 

(g) a nucleic acid sequence encoding a transcriptional activator; and 

(h) a terminator sequence. 

1 9. The vector according to claim 1 8, wherein the selection marker according to 
element (1) comprises a marker selected from the ampicilline resistance gene, the 
kanamy . : v ^Distance gene or the chloramphenicol resistance gene. V | 

20. The vector according to any of claims 18 or 19, wherein the origin of replication 
according to element (2) is selected from a pUC based or pBR322 origin of 
replication. 

21 . The vector according to any of claims 18 to 20, wherein the selection marker 
according to element (3) is selected from any marker that is selectable in S. 
cerevisiae, preferably the LEU2 gene. 

22. The vector according to any of claims 1 8 to 21 , wherein the origin of replication 
according to element (4) is a low copy number origin, preferably selected from 
CEN and ARS origins. 



23. 



The vector according to any of claims 18 to 22, wherein the promoter according to 
element (5a) is selected from CYC1, GAL1, CUP1, ADH1 or TEFL 
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24. The vector according to any of claims 1 8 to 23, wherein the nucleic acid sequen- 
ce encoding a leader according to element (5b) is selected from a signal sequen- 
ce derived from a yeast integral membrane protein or a signal sequence which 
confers fatty acid modification preferably to the following peptide (N- 
MGCTLSAEDKPGGP-C). 

25. A vector according to any of claims 1 8 to 24, wherein the site for receiving a nuc- 
leic acid sequence according to feature (5c) is a multiple cloning site or a se- 
quence allowing the insertion of a further nucleic acid sequence by in vivo recom- 
bination. 

26. A vector according to any of claims 1 8 to 25, wherein the nucleic acid sequence 
according to element (5d) encodes the C-terminal part of yeast ubiquitin from a- 
mino acids 35 to 76. 

27. A vector according to any of claims 18 to 26, wherein the tag for detecting the 
expression of the polypeptide fusion is selected from epitopes of Gal4, LexA, B42 
or VP16 or 3x FLAG, 3x MYC or HA epitopes or the hemagglutinin epitopes. 

28. A vector according to any of claims 1 8 to 27, wherein the nucleic acid encoding 
the DNA binding protein according to element (5f) is selected from the sequence 
encoding the bacterial LexA protein, the sequence encoding the yeast Gal4 pro- 
tein, more preferably amino acids 1 to 93, most preferably the sequence encoding 
amino acids 1 to 74 of the Gal4 protein. 

29. A vector according to any of claims 18 to 28, wherein the transcriptional activator 
according to element (5g) is selected from the Herpes simplex virus protein VP16 
or more preferably the acidic domain B42. 
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A vector according to any of claims 18 to 29, wherein the terminator sequence 
according to element (5h) is selected from the CYC1 or ADH1 terminator sequen- 
ce. 

A vector according to any of claims 1 8 to 30, wherein the vector further contains a 
sequence coding for a first membrane bound protein or part thereof (the bait pro- 
tein). 

A vector suitable as a prey vector in a method according to any of claims 1 to 8 or 
a kit according to any of claims 9 to 17 comprising the following elements: 

(1 ) a selection marker for propagation of the vector in E coli; 

(2) an origin of replication which allows propagation of the vector in E. coli; 

(3) a selection marker for propagation of the vector in yeast; 

(4) an origin of replication which allows propagation of the vector in yeast; and 

(5) . an expression cassette comprising the following element?. 

(a) a promoter element conferring low level expressi© tf^ inducible ex- 
pression or high level expression; 

(b) an open reading frame encoding the N-terminal part of yeast ubiqui- 
tin (Nub) or a mutated ubiquitin (NbM); 

(c) a nucleic acid sequence encoding a tag which allows detection of the 
expressed polypeptide fusion; 

(d) a site capable of receiving a nucleic acid sequence encoding a pro- 
tein; and 

(e) a terminator sequence. 

The vector according to claim 32, wherein the selection marker according to ele- 
ment (1) is selected from the ampicilline resistance gene, the kanamycine re- 
sistance gene and the chloramphenicol resistance gene. 

The vector according to any of claims 32 or 33, wherein the origin of replication 
according to element (2) is selected from an origin such as pUC based or 
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pBR322 based origin or preferably an origin of replication allowing a high copy 
number of the plasmid when propagated in E. coli. 

35. The vector according to any of claims 32 to 34, wherein the selection marker ac- 
cording to element (3) is selected from a marker that is selectable in S. cerevisi- 
ae, preferably a sequence encoding the TRP1 gene. 

36. The vector according to any of claims 32 to 35, wherein the origin of replication 
according to element (4) is selected from an origin which allows propagation of 
the vector is S. cerevisiae, preferably the two micron based or CEN/ARS origin of 
replication or more preferably an origin of replication that allows a high copy 
number in S. cerevisiae, preferably the two micron origin of replication. 

37. A vector according to any of claims 32 to 36, wherein the promoter element ac- 
cording to element (5a) is selected from CYC1, GAL1, CUP1, ADH1 or TEFL 

38. A vector according to any of claims @ 37, wherein the open reading frame 
according to element (5b) encodes a part of the yeast ubiquitin comprising amino 
acids 1 to 37 of the wild-type or said part bearing an amino acid exchange at 
either position 3 or position 13 or both, wherein the amino acid replacing the ori- 
ginal amino acid is preferably selected from leu, val, ala or gly. 

39. A vector according to any of claims 32 to 38, wherein the tag is selected from the 
3x FLAG epitope, 3x MYC epitope, HA epitopes or an hemagglutinin epitope. 

40. The vector according to any of claims 32 to 39, wherejn the site capable of recei- 
ving a further sequence according to element (5d) is selected from a multiple clo- 
ning site or a sequence allowing insertion of the further sequence by in vivo re- 
combination. 

41 . A vector according to any of claims 32 to 40, wherein the terminator sequence is 
selected from yeast ADH1 orCYCI terminators. 
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42. The vector according to any of claims 32 to 41 , wherein the vector contains a 
further sequence encoding a protein or part thereof (the prey protein) which pro- 
tein is either membrane bound or soluble. 

43. Host cell containing the vectors and/or plasmids as defined in any of claims 9 to 
42. 



44. Use of the kit according to any of claims 9 to 1 7 or of the vector according to any 
of claims 18 to 41 in a process for detecting interaction between a first membrane 
bound protein or part thereof and a second protein or part thereof which is either 
membrane bound or soluble. 



45. Use of the host cell according to claim 43 in a method for detecting an interaction 
between a first membrane bound protein or part thereof and a second protein or 
part thereof wr»ch is either membrane bound or soluble. ... 

46. Use of the kit according to any of claims 9 to 1 7 or the vector according to any of 
claims 18 to 41 or the host cell of claim 43 in a screening process for identifying 
pharmaceutical drugs. 

47. A method for providing a compound capable of interfering with protein/protein interac- 

tion comprising the following screening steps: 

(1) providing a host cell according to claim 43, the bait and prey polypeptides 
being selected spch that they interact when expressed in the host cell; 

(2) incubating said host cell in the presence and absence of the compound(s) to 
be tested; 

(3) measuring the difference in reporter gene expression between the incubati- 
on containing the compound(s) to be tested and the incubation free of the 
compound(s) to be tested; and optionally 

(4) purifying or synthesizing the compound positively tested with the screen. 
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AMENDED CLAIMS 

[received by the International Bureau on 06 October 2003 (06.10.03); 
original claims 1-47 amended (9 pages)] 

1 . A method for detecting an interaction between a first membrane bound protein or 
part thereof and a second protein or part thereof which is either membrane bound 
or soluble, the method comprising: 



(a) providing a host cell containing at least one detectable gene (reporter 
gene) having a binding site for a transcriptional activator, such that the 
detectable gene expresses a detectable product, preferably a protein, 
when the detectable gene is transcriptionally activated; 

(b) (i) providing, as part of a first bait vector, a first chimeric gene under the 
control of a strong promoter, and 

(ii) providing, as part of a second bait vector, a first chimeric gene under 
the control of a weak promoter, 

said first chimeric gene being c&o- < .t ^ being expressed in said host cell 
and coding inter alia for a first memorane protein or part thereof which 
gene is attached to the DNA-sequence of a first module encoding inter alia 
a first protein sequence involved in intracellular protein degradation and a 
transcriptional activator, said first protein or part thereof to be tested 
whether it can interact with a second protein or part thereof; 

(c) providing, as part of a prey vector, a second chimeric gene under the 
control of a strong promoter capable of being expressed in said host cell, 
the second chimeric gene coding inter alia for a second protein or part 
thereof which is either membrane bound or soluble and which gene is 
attached to the DNA sequence of a second module encoding inter alia a 
second protein sequence involved in intracellular protein degradation; 

(d) introducing the first repectively the second bait vector and the prey vector 
into the host cell such that an interaction between the expressed first and 



AMENDED SHEET (ARTICLE 19) 



WO 03/083136 




PCT7EP03/03287 



second protein and/or their parts can take place, which interaction leads to 
an interaction of the first protein sequence of the first module and the 
second protein sequence of the second module which interaction in turn 
leads to activation of an intracellular protease and proteolytic separation of 
the transcriptional activator, at least one of the bait vector and prey vector, 
preferably both, being suitable for being maintained episomally; 

(e) determining whether the detectable gene of the host cell has been 
activated by the transcriptional activator. 

2. The method according to claim 1, wherein the host cell is a yeast, a bacterial or a 
mammalian cell. 

3. The method according to claim 2, wherein the host cell is a Saccharomyces 
pombe cell or, more preferably, cells of the budding yeast Saccharomyces 
cerevisiae. 

4. The method s ; v>ng to any of claims 1 to 3, wherein the detectable ger\e is 
activated by a natural or artificial activator, preferably an activator comprising a 
short tagging module. 

5. The method according to any of claims 1 to 4, wherein the detectable gene is by 
the artificial transcriptional activator protein A-LexA-V16 (PLV). 

6. The method according to any of claims 1 to 5, wherein the first protein sequence 
comprises a C-terminal portion of ubiquitin (Cub) or a mutant thereof (CbM) and 
the second protein sequence comprises an N-terminal portion of ubiquitin (Nub) 
or a mutant thereof (NbM). 

7. The method according to any of claims 1 to 6, wherein the DNA-sequence coding 
for the first membrane protein is selected from the group consisting of any bacte- 
rial membrane protein, any viral membrane protein, any oncogene-encoded 
membrane protein, any growth factor receptor or any eukaryotic membrane pro- 
tein, or parts thereof. 

8. The method according to any one of the preceding claims wherein the second 
membrane protein or the soluble protein, or part thereof, is encoded by a library: 
plasmids. 
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9. A kit for detecting binding between a first membrane bound protein or part thereof 
a second protein or part thereof which is either membrane bound or soluble 
comprising: 

(a) a host cell containing at least one detectable gene (reporter gene) having 
a binding site for a transcriptional activator, such that the detectable gene 
expresses a detectable product, preferably a protein, when the detectable 
gene is transcriptionally activated; 

(b) a first vector (bait) comprising a first site capable of receiving a first nucleic 
acid coding for a first membrane protein or part thereof such that when the 
first nucleic acid is inserted it becomes attached to the DNA sequence of a 
first module encoding infer alia a first protein sequence involved in 
intracellular protein degradation, the module further comprising a nucleic 
for a transcriptional activator; 

(d) a second vector (prey) comprising a second site capable of receiving a se- 
cond nucleic acid coding for a second membrane protein or a soluble pro- 
tein or part thereof such that when the second nucleic acid is inserted it 
becomes attached to the DNA sequence of a second module encoding 
inter alia a sequence protein sequence involved in intracellular protein 
degradation; and optionally 

(e) a plasmid library encoding second proteins or parts thereof. 

10. The kit according to claim 9, wherein the host cell is a yeast, bacterial or mamma- 
lian cell. 

11. The kit according to claim 10, wherein the host cell is a yeast cell, preferably of 
Saccharomyces pombe and most preferably of the budding yeast 
Saccharomyces cerevisiae. 

12. The kit according to any of claims 9 to 11, wherein the detectable gene can be 
activated by a natural or artificial activator, preferably an activator comprising a 
short tagging module. 

13. The kit according to any of claims 9 to 12, wherein the detectable gene can be 
activated by the artificial transcriptional activator protein A-LexA-V16 (PLV). 
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14. The kit according to any of claims 9 to 13, wherein the first protein sequence 
contains the C-terminal part of ubiquitin (Cub) or a mutant thereof and the second 
protein sequence contains the N-Terminal part of ubiquitin (Nub) or a mutant the- 
reof. 

15. The kit according to any of claims 9 to 14, wherein the chimeric gene is under the 

control of a promoter selected from the group consisting of ADH promoter, CYC1 
promoter or TEF1 promoter. 

16. The kit according to any of claims 9 to 15, wherein the DNA-sequence coding for 
the first membrane protein is derived from any bacterial membrane protein, any 
viral membrane protein, any oncogene-encoded membrane protein, any growth 
factor receptor or any eukaryotic membrane protein, or parts thereof. 

17. The kit according to any of claims 9 to 16 wherein the DNA-sequence coding for 

the second protein is contained in a library of plasmids. 

18. A vector useful as a bait vector in a method according to any of claims ^ a 
kit according to any of claims 9 to Comprising the following elements: 

(1 ) a selection marker for propagation of the vector in E. coli; 

(2) an origin of replication which allows propagation of the vector in E. coli; 

(3) a further selection marker for propagation of the vector in yeast; 

(4) an origin of replication which allows propagation of the vector in yeast; and 

(5) an expression cassette comprising the following elements: 

(a) a promoter element conferring low level expression or inducible 
expression or high level expression; 

(b) a nucleic acid sequence encoding a leader; 

(c) a site capable of receiving a nucleic acid encoding a protein; 
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(d) a nucleic acid sequence encoding the C-terminal part of yeast ubiquitin 
(Cub) or mutated yeast ubiquitin (CbM); 

(e) a nucleic acid sequence encoding a tag that allows detection of the fusion 
protein; 

(f) a nucleic acid sequence encoding a DNA binding protein; 

(g) a nucleic acid sequence encoding a transcriptional activator; and (h) a 
terminator sequence. 

19. The vector according to claim 18, wherein the selection marker according to 
element (1) comprises a marker selected from the ampicilline resistance gene, 
the kanamycine resistance gene or the chloramphenicol resistance gene. 

20. The vector according to any of claims 18 or 19, wherein the origin of replication 
according to element (2) is selected from a p.UC based or pBR322 origin of 
replication. 

21. The vector according to any of claims 18 to 20, wherein the selection marker 
according to element (3) is selected from any marker that is selectable in S. 
cerevisiae, preferably the LEU2 gene. 

22. The vector according to any of claims 18 to 21, wherein the origin of replication 
according to element (4) is a low copy number origin, preferably selected from 
CEN and ARS origins. 

23. The vector according to any of claims 1 8 to 22, wherein the promoter according 
to element (5a) is selected from CYC1 , GAL1 , CUP1 , ADH1 or TEF1 . 

24. The vector according to any of claims 18 to 23, wherein the nucleic acid sequence 

encoding a leader according to element (5b) is selected from a signal sequence 
derived from a yeast integral membrane protein or a signal sequence which 
confers fatty acid modification preferably to the following peptide (N- 
MGCTLSAEDKPGGP-C). 

25. The vector according to any of claims 18 to 24, wherein the site for receiving a 
nucleic acid sequence according to feature (5c) is a multiple cloning site or a se- 
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quence allowing the insertion of a further nucleic acid sequence by in vivo recom- 
bination. 

26. The vector according to any of claims 18 to 25, wherein the nucleic acid 
sequence to element (5d) encodes the C-tenminal part of yeast ubiquitin from 
amino acids 35 to 76. 

27. The vector according to any of claims 18 to 26, wherein the tag for detecting the: 

expression of the polypeptide fusion is selected from epitopes of Gal4, LexA, B42 
or VP16 or 3x FLAG, 3x MYC or HA epitopes or the hemagglutinin epitopes. 

28. The vector according to any of claims 18 to 27, wherein the nucleic acid encoding 
the DNA binding protein according to element (5f) is selected from the sequence 
encoding the bacterial LexA protein, the sequence encoding the yeast Gal4 pro- 
tein, more preferably amino acids 1 to 93, most preferably the sequence 
encoding amino acids 1 to 74 of the Gal4 protein. 

29. The vector according & any of claims 18 to 28, wherein the transcriptional 
activator according fe> element (5g) is selected from the Herpes simplex virus 
protein VP16 or more preferably the acidic domain B42. 

30. The vector according to any of claims 18 to 29, wherein the terminator sequence 
according to element (5h) is selected from the CYC1 or ADHI terminator 
sequence 

31. The vector according to any of claims 18 to 30, wherein the vector further 
contains a coding for a first membrane bound protein or part thereof (the bait pro- 
tein). 

32. A vector suitable as a prey vector in a method according to any of claims 1 to 8 or 
a kit according to any of claims 9 to 17 comprising the following elements: 

(1 ) a selection marker for propagation of the vector in E. coli; 

(2) an origin of replication which allows propagation of the vector in E. coli, 

(3) a selection marker for propagation of the vector in yeast; 
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(4) an origin of replication which allows propagation of the vector in yeast; and 

(5) an expression cassette comprising the following elements: 

(a) a promoter element conferring low level expression or inducible ex- 
pression or high level expression; 

(b) an open reading frame encoding the N-terminal part of yeast ubiquitin 
(Nub) or a mutated ubiquitin (NbM); 

(c) a nucleic acid sequence encoding a tag which allows detection of the 
expressed polypeptide fusion; 

(d) a site capable of receiving a nucleic acid sequence encoding a protein; 
and 

(e) a terminator sequence. 

3S. The vector according to claim 32, wherein the selection marker according to ele- 
ment (1) is selected from the ampicilline resistance gene, the kanamycine re- 
sistance gene and the chloramphenicol resistance gene. 

34. The vector according to any of claims 32 or 33, wherein the origin of replication 
according to element (2) is selected from an origin such as pUC based or 
pBR322 based origin or preferably an origin of replication allowing a high copy 
number of the plasmid when propagated in E. coli. 

35. The vector according to any of claims 32 to 34, wherein the selection marker ac- 
cording to element (3) is selected from a marker that is selectable in S. 
cerevisiae, preferably a sequence encoding the TRP1 gene. 

36. The vector according to any of claims 32 to 35, wherein the origin of replication 
according to element (4) is selected from an origin which allows propagation of 
the vector is S. cerevisiae, preferably the two micron based or CEN/ARS origin of 
replication or more preferably an origin of replication that allows a high copy 
number in S. cerevisiae, preferably the two micron origin of replication. 
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37. The vector according to any of claims 32 to 36, wherein the promoter element ac- 
cording to element (5a) is selected from CYC1, GAL1, CUP1, ADH1 or TEF-I. 

38. The vector according to any of claims 32 to 37, wherein the open reading frame 
according to element (5b) encodes a part of the yeast ubiquitin comprising amino 
acids 1 to 37 of the wild-type or said part bearing an amino acid exchange at 
either position 3 or position 13 or both, wherein the amino acid replacing the ori- 
ginal amino acid is preferably selected from leu, val, ala or gly. 

39. The vector according to any of claims 32 to 38, wherein the tag is selected from 
the 3x FLAG epitope, 3x MYC epitope, HA epitopes or an hemagglutinin epitope. 

40. The vector according to any of claims 32 to 39, wherein the site capable of 
receiving a further sequence according to element (5d) is selected from a multiple 
cloning site or a sequence allowing insertion of the further sequence by in vivo re- 
combination. 

41. The vector according to any of claims 32 to 40, wherein the terminator sect#ti&$*- 
is selected from yeast ADH1 or CYC1 terminators. 

42. The vector according to any of claims 32 to 41, wherein the vector contains a 
further sequence encoding a protein or part thereof (the prey protein) which pro- 
tein is either membrane bound or soluble. 

43. Host cell containing the vectors and/or plasmids as defined in any of claims 9 to 
42. 

44. Use of the kit according to any of claims 9 to 17 or of the vector according to any 
of claims 18 to 41 in a process for detecting interaction between a first membrane 
bound protein or part thereof and a second protein or part thereof which is either 
membrane bound or soluble. 

45. Use of the host cell according to claim 43 in a method for detecting an interaction 
between a first membrane bound protein or part thereof and a second protein or 
part thereof which is either membrane bound or soluble. 
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46. Use of the kit according to any of claims 9 to 17 or the vector according to any of 
claims 18 to 41 or the host cell of claim 43 in a screening process for identifying 
pharmaceutical drugs. 

47. A method for providing a compound capable of interfering with protein/protein 
interaction comprising the following screening steps: 

(1 ) providing a host cell according to claim 43, the bait and prey polypeptides; 
being; selected such that they interact when expressed. 

(2) incubating said host cell in the presence and absence of the compound(s) 
to be tested; 

(3) measuring the difference in reporter gene expression belween the 
incubation containing the compound(s) to be tested and the incubation 
free of the compound(s) to be tested; and optionally 

(4) purifying or synthesizing the compound poHr *:•>• tested with the 
screen. 
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